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CHIMERIC GFP-AEQUOR1N AS BIOLUMINESCENT 
Ca^ REPORTERS AT THE SINGLE CELL LEVEL 



CROSS-REFERENCE TO RELATED APPLICATIONS 

[001] This application is based on and claims the benefit of U.S. Provisional 
Application Nos. 60/208,314, filed June 1, 2000 (Attorney Docket No. 03495.6051), 
60/210,526, filed June 6, 2000 (Attorney Docket No. 03495.6052), and 60/255,1 11, filed 
December 14, 2000 (Attorney Docket No. 03495.6059). The entire disclosure of each of 
these applications is relied upon and incorporated by reference herein. 

BACKGROUND OF THE INVENTION 

[002] This invention relates to a modified bioluminescent system comprising a 
flourescent molecule covalently linked with a photoprotein allowing the transfer of 
energy by Chemiluminescence Resonance Energy Transfer (CRET). This invention also 
relates to the use of the modified bioluminescent system in in vivo and in vitro assays. 

[003] Calciufri is implicated in the regulation of a great variety of intracellular 
processes (1). Several techniques are most commonly used for intracellular Ca"*" 1 " 
monitoring. Patch-clamp and Ca"^ selective microelectrodes give cumulative 
measurements of Ca** fluxes in a restricted number of cells. On the other hand, 
intracellular [Ca"^] dynamics in large populations of cells can be visualized with 
fluorescent probes (2). Genetic tools could provide new methods for Ca""" monitoring. 

[004] Two groups of genetic Ca** probes are at present available. The first 
category uses the principle of Fluorescence Resonance Energy Transfer (FRET) between 
two variants of the green fluorescent protein (GFP). The two GFP are covalently linked 
by a calmodulin binding sequence alone or in combination with calmodulin so that 
intramolecular FRET does (3) or does not (4) occur in response to Ca"^ influx. The 
second category is composed by bioluminescent proteins, such as aequorin (5, 6). The 
active protein is formed in the presence of molecular oxygen from apoaequorin (189 
amino acids) and its luciferin, coelenterazine (Mr 423) (7). 

[005] The binding of Ca 4 ^ to aequorin, which has three EF-hand structures 
characteristic of Ca^ binding sites, induces a conformational change resulting in the 
oxidation of coelenterazine via an intramolecular reaction. Moreover, the 
coelenteramide so produced is in an excited state, and blue light (max: 470nm) is 
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emitted when it returns to its ground state (8). Such a bioluminescent genetic marker 
presents the advantage over Ca^-sensitive fluorescent dyes of being easily targeted to 
specific cells and in subcellular compartments with appropriate regulatory elements and 
peptide signals (9). The bioluminescent process does not require light excitation like 
fluorescent probes or proteins, and thus does not induce autofluorescence, 
photobleaching and biological degradation problems. Furthermore, aequorin is not 
toxic, does not bind other divalent cations and does not interfere with the [Ca 4 ^ buffer 
system even when microinjected at high concentrations. Its low affinity for Ca"^ (Kd = 
10 (jaM) is probably responsible for this and makes aequorin a good sensor in the range 
of biological [Oa* 4 ] variations. 

[006] Although providing a good ratio of signal over background, aequorin 
signals are very difficult to detect because of aequorin's low light quantum yield, that is, 
the number of emitted photons per protein that bind Ca^. In the jellyfish, Aequorea 
victoria, from which aequorin has been isolated (10), the protein is associated with the 
GFP (11). After Ca"^ binding, the energy acquired by aequorin is transferred from the 
activated oxyluciferin to GFP without emission of blue light. The GFP acceptor 
fluorophore is excited by the oxycoelenterazine through a radiationless energy transfer. 
Then, a green light (max, 509nm) is emitted when the excited GFP returns to its ground 
state (12). 

[007] Such intermolecular radiationless energy transfer is not unusual in 
bioluminescence and has already been shown to increase the quantum yield of the 
bioluminescent process in Renilla, another coelenterate (13). The gain measured in vitro 
ranges from 3 to 5 fold (14). It is possible to reconstitute in vitro the Renilla system and 
obtain the spectral shift with low equimolar concentrations of its components because 
the luciferase and the green fluorescent protein bind together (14). 

[008] In the Aequorea system, binding between purified photoprotein and GFP 
does not occur in solution, even when present at high concentrations (15). In vivo, 
energy transfer occurs because of the high concentration of GFP. It can be obtained in 
vitro through the co-adsorption of aequorin and GFP on DEAE cellulose membranes 
(15). The Forster equation shows that the efficiency of this process depends on several 
conditions described in the case of FRET. The emission spectrum of the donor must 
have the greatest overlap with the excitation spectrum of the acceptor. The energy 
transferred is also strongly dependent on the geometry, in particular, the relative 
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orientation and distance of the two dipoles and modulated by their respective motion 
(16). 

[009] An aim of this invention is to develop a dual reporter gene combining 
properties of Ca"^ -sensitivity and fluorescence of aequorin and GFP, respectively. The 
fusion protein, which can be detected with classical epifluorescence, can be used to 
monitor calcium activities. The configuration of the molecules of the invention 
increases their overall turnover and allows an efficient intramolecular 
Chemiluminescence Resonance Energy Transfer (CRET). As a result, the quantum 
yield of aequorin appears to be higher. This invention shows that physiological calcium 
signals can be visualized in single eukaryotic cells with an intensified CCD camera. 
Other constructs described here target the fusion protein to the neurite membrane. 

SUMMARY OF THE INVENTION 

[0 1 0] This invention thus provides a modified bioluminescent system 
comprising a fluorescent molecule covalently linked with a photoprotein, wherein the 
link between the two proteins has the function to stabilize the modified bioluminescent 
system and allow the transfer of the energy by Chemiluminescence Resonance Energy 
Transfer (CRET). In a preferred embodiment, the bioluminescent system comprises a 
GFP protein covalently linked to an aequorin protein, wherein the link between the two 
proteins has the function to stabilize the modified bioluminescent system and to allow 
the transfer of the energy by Chemiluminescence Resonance Energy Transfer (CRET). 

[011] In one embodiment of a modified bioluminescent system according to the 
invention, the bioluminescent system comprises a GFP protein covalently linked to an 
aequorin protein, wherein the link between the two proteins is constituted by at least 5 
amino acids and optionally at least 5 amino acids and at least one copy of 9 amino acids. 
The link has the function of stabilizing the system and allowing the transfer of energy by 
Chemiluminescence Resonance Energy Transfer (CRET). 

[012] In a preferred embodiment, the bioluminescent system comprises a GFP 
protein covalently linked to an aequorin protein, wherein the link between the two 
proteins is preferably constituted by at least 5 amino acids and five copies of 9 amino 
acids and has the function of stabilizing the system and allowing the transfer of energy 
by Chemiluminescence Resonance Energy Transfer (CRET). 

[013] The two proteins can be separate or together functional. In addition, the 
modified bioluminescent system can be calcium sensitive and/or light sensitive. 
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[014] This invention also provides a method of screening in vitro a change in a 
physical, chemical, biochemical, or biological condition. The method comprises: 

a) providing in different samples a bioluminescent system according to the 
invention in a reaction system containing an analyte of interest; 

b) measuring whether light is produced; and 

c) detecting a change based on the production of light. 

[015] Further, this invention provides a method of screening in vivo a change in 
a physical, chemical, biochemical, or biological condition. The method comprises the 
steps of: 

a) administering to a mammal an acceptable composition comprising a 
bioluminescent system according to the invention; 

b) detecting whether light is produced; and 

c) optionally measuring ionic concentration of calcium flux. 

[016] In addition, this invention provides a composition comprising a purified 
polypeptide, wherein the composition has the functional characteristics of binding 
calcium ions and permitting measureable energy, said energy depending of the quantity 
of calcium bound and of the quantity of polypeptides in said composition in absence of 
any light excitation. 

[017] In addition, this invention provides a purified polypeptide having the 
amino acid sequence of SEQ ID NO: 1; SEQ ID NO: 2; SEQ ID NO: 3; SEQ ID NO: 
4;SEQIDNO: 5; and SEQ ID NO: 6. 

[018] In other embodiments, this invention provides a polynucleotide having 
the sequence of SEQ ID NO: 7; SEQ ID NO: 8; SEQ ID NO: 9; SEQ ID NO: 10; 
SEQ ID NO: 11; and SEQ ID NO: 12. 

[019] This invention also provides a culture as deposited at the C.N. CM. and 
containing the plasmid No. 1-2507; the plasmid No. 1-2508; the plasmid No. 1-2509; the 
plasmid No. 1-2510; the plasmid No. 1-251 1; the plasmid No. 1-2512; or the plasmid No. 
1-2513. 

[020] Further, this invention provides a peptide linker having the function after 
translation to approach a donor site to an acceptor site in optimal conditions to permit a 
direct transfer of energy by chemiluminescence in a purified polypeptide according to 
the invention. The nucleotide linker can have, for example, the nucleotide sequence of 
SEQ ID No: 13; SEQ ID No: 14; SEQ ID No: 15; SEQ ID No: 16, or SEQ ID No: 
17. The peptide linker can comprise at least 5 amino acids and comprising the amino 
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acid sequence of SEQ ID No: 18; SEQ ID No: 19; SEQ ID No: 20; SEQ ID No: 21, or 
SEQIDNo: 22. 

[021] A kit for measuring the transfer of energy in vivo or in vitro contains at 
least one of the polypeptides according to the invention or the polynucleotide according 
to the invention and the reagents necessary for visualizing or detecting the said transfer 
in presence or in absence of a molecule of interest. 

[022] In another embodiment, the invention provides a fusion protein of the 
formula: 

GFP - LINKER - AEQ; 

[023] wherein GFP is green fluorescent protein; AEQ is aequorin; and LINKER 
is a polypeptide of 4-63 amino acids, preferably 14-50 amino acids. 

[024] The LINKER can comprise the following amino acids: 

[025] (Gly Gly Ser Gly Ser Gly Gly Gin Ser [SEQ ID NO: 25]) n , wherein n is 
1-5. Preferably n is 1 or n is 5. LINKER can also include the amino acid sequence Ser 
Gly Leu Arg Ser [SEQ ID NO: 26]. 

[026] Another fusion protein for energy transfer from aequorin to green 
fluorescent protein by Chemiluminescence Resonance Energy Transfer (CRET) 
following activation of the aequorin in the presence of Ca 4 "* has the formula: 
GFP - LINKER - AEQ; 

[027] wherein GFP is green fluorescent protein; AEQ is aequorin; and LINKER 
comprises the following amino acids: 

[028] (Gly Gly Ser Gly Ser Gly Gly Gin Ser [SEQ ID NO: 25]) n , wherein n is 
1-5; and wherein the fusion protein has an affinity for Ca""" ions and a half-life of at least 
24 hours. The LINKER can include the amino acid sequence Ser Gly Leu Arg Ser [SEQ 
ED NO: 26], In addition, the fusion protein can further comprise a peptide signal 
sequence for targeting the fusion protein to a cell or to a subcellular compartment. 

[029] This invention also provides polynucleotides encoding fusion proteins as 
described above. 

BRIEF DESCRIPTION OF THE PR AWTNCS 

[030] This invention will be described with reference to the drawings in which: 
[031] Fig. 1 is a schematic map of different constructions. All the constructs 
were under the control of the human cytomegalovirus promoter (PCMV). An asterisk 
indicates the position of a Val-163-Ala mutation. In pGA, the coding sequences of GFP 
and aequorin are separated by 5 codons. One to five linkers (in brackets) have been 
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added in pGiA where i is the number of linker. Linkers were oriented so as to encode a 
9 amino acid repeat. Complete Synaptotagmin 1 or its transmembrane part (tSyn) were 
fused in frame with the G5 A. 

[032] Fig. 2 depicts Ca^ CRET activities on cellular extracts. Emission spectra 
of aequorin and several GFP-Aequorin fusion proteins were calibrated as a percentage of 
maximum intensity. CRET measurements are expressed as the ratio of green (500nm) 
over blue (450nm) photons. 

[033] Fig. 3 depicts GFP fluorescence of GFP-Apoaequorin proteins in 
Neuro2A cells transfected with pGm (A), pGA (B), pG2A (C), and pG5 A (D). Confocal 
superposition of GFP fluorescence and immunostaining of synaptotagmin in cells 
expressing either pSG5A (E) or pStG5A (F) is shown. 

[034] Fig. 4 depicts Ca^-induced bioluminescence detected at the single cell 
level Neuro2A cells transfected with pGA (A. 1-4) or pSG5A (B) were pre-incubated 
with 5jliM coelenterazine in a Ca^-free buffer. (A. 3) GFP Fluorescence made it 
possible to choose transfected cells. The background recorded before CaCl 2 addition 
(A.2) corresponds to the relative light unit (RLU) -level at time 0 of experiment (A.4, B). 
Intensities of fluorescence and bioluminescence activity are translated in scaled 
pseudocolors. Representative pictures of the chosen field are shown after addition of 
5mM CaCl 2 and 5|liM A23 187 at 13 sec. and 159 sec, respectively, after the beginning of 
the acquisition (A. 1). (A.4) Each profile indicates the intensity of light emitted by a 
single cell. 

[035] Five regions of interest were defined by encircling individual cell soma. 
With pGA (data not shown) or pSG5A (B) transfection, a high concentration of CaCl 2 , 
(1 OOmM) was added at the end of the experiment (500sec.) to check that the 
bioluminescent protein was still active. (C) Control experiments were made with Fluo-3 
AM on mock-transfected Neuro2A cells. 

[036] Fig. 5 depicts the results of analysis of protein stability for various fusion 
proteins. 

[037] Fig. 6 depicts the results of the determination of the Ca^ affinity of 
aequorin and fusion protein G5 A. 

[038] Fig. 7 depicts the calibration curves between the bioluminescent activity 
and Ca2+, for G5A, SG5A, and aequorin. 

[039] Fig. 8 shows fluorescence and Ca2+-induced bioluminescent activity in 
dissociated neurons in culture infected with adenoviral-G5A vectors. 
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[040] Fig. 9 shows fluorescence and Ca2+-induced bioluminescent activities in 
dissociated neurons in culture infected with adenoviral-SG5A vectors. 

[041] Fig. 10 shows representative pattern of luminescence activity after 
injection of GA plasmid at the one cell stage of Xenopus embryo. 

[042] Fig. 1 1 shows a transgenic Xenopus larva with GFP-aequorin. 
DETAILED DESCRIPTION OF THE INVENTION 

[043] Among the coelenterates, bioluminescent species exist. Numerous 
studies have shown that the bioluminescence is generated by photoproteins that are 
sensitive to calcium. These proteins emit a flash of light in response to an increase in 
the concentration of calcium ions. Among these photoproteins, aequorin is one of the 
most well studied (Blinks et aL, 1976). 

[044] Isolated in the jellyfish, Aequoria victoria (Shimomura et aL, 1962), 
aequorin, after binding with three calcium ions, emits a flash of blue light with a 
spectrum of maximum wavelength 470 nm. Contrary to a classical luciferase-luciferin 
reaction, the emission of light does not require oxygen, and the total amount of light is 
proportional to the amount of protein. Oxygen is necessary, however, to reconstitute the 
aequorin, by the action of apoaequorin, a protein with a molecular mass of 21kDa, and 
coelenterazine. The emission of photons is caused by a peroxidation reaction in the 
coelenterazine, after binding with the three calcium ions on the aequorin. Two 
hypotheses have been suggested for this process: (i) the binding between aequorin and 
calcium ions induces the emission of light by a conformational change in the protein, 
allowing oxygen to react with coelenterazine, and (ii) oxygen plays a role in the binding 
between coelenterazine and apoaequorin (Shimomura and Johnson, 1978). Aequorin 
may be recreated in vitro and in vivo by eliminating oxyluciferin, adding luciferin 
(coelenterazine) in the presence of p-mercaptoethanol and oxygen (Shimomura and 
Johnson, 1978). The necessity of adding p-mercaptoethanol or a reducing agent to 
reconstitute aequorin is presumably due to the presence of at least one sulfhydryl group 
of cysteine 145 included in a negatively charged microenvironment (Charbonneau et aL, 
1985). 

[045] More than thirty semi-synthetic aequorins having different affinities for 
calcium ions have been characterized, based on the type of coelenterazine that binds to 
the protein (Shimomura, 1991; incorporated by reference herein). The dissociation 
constant between aequorin and the calcium ions is estimated to be between 0. 1 mM 
(Aliens a/., 1997) and 1 mM (Prasher et aL, 1985). Although the relationship between 
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light emission and calcium ion concentration may not be linear, a logarithmic 
relationship between the emission of light and the calcium ion concentration has 
nonetheless been determined (Johnson and Shimomura, 1978). Indeed, a 200-fold 
increase in the signal to background noise ratio is measured when the Ca""" concentration 
goes from 10~ 7 M to lO^M , and by a factor of 1000, from lO^M to 10' 5 M (Cobbold and 
Rink, 1987). Moreover, the kinetics of the signal emission is rapid enough to detect 
transitory increases in Ca"^ ion concentrations. An increase in light intensity with a time 
constant of 6 msec, under calcium saturation conditions, has been shown (Blinks et ah, 
1978). Aequorin is thus a photoprotein that is well adapted to measure rapid and 
elevated increases in Ca** ions under physiological conditions. 

[046] The cloning of the apoaequorin gene by Prasher et ah, (1985) and Inouye 
et ah (1985) has led to the creation of expression vectors, making possible its targeting 
in a specific cell compartment by fusion with nuclear, cytoplasmic, mitochondrial, 
endoplasmic reticulum, or plasma membrane signal peptides (Kendall et ah, 1992; Di 
Giorgio et ah, 1996). In addition, the in vivo expression of the protein makes possible 
its detection at low levels, leaving the intracellular physiology of calcium undisturbed. 

[047] In nature, photoprotein activity is very often linked to a second protein. 
The most common is the "green flourescent protein" or GFP. The light emitted in this 
case is in fact green. The hypothesis of an energy transfer between aequorin and GFP 
by a radiative mechanism was proposed in the 1960s by Johnson et ah, (1962). The blue 
light emitted by aequorin in the presence of Ca"^ is presumably absorbed by GFP and 
reemitted with a spectrum having a maximum wave length of 509 nm. Other studies 
have shown that this transfer of energy occurs through a non-radiative mechanism made 
possible through the formation of heterotetramer between GFP and aequorin. Morise et 
ah (1974) have succeeded in visualizing this energy transfer in vitro, and a co- 
adsorption of the two molecules on a DEAE-cellulose membrane facilitates the process. 
Through this mechanism, it thus appears possible to increase the quantum efficiency of 
the system (Ward and Cormier, 1976). 

[048] GFP, also isolated in the jelly fish Aequoria victoria, was recently cloned 
(Prasher et ah, 1992). It has been used in different biological systems as a cellular 
expression and lineage marker (Cubitt et ah, 1995). Detecting this protein using 
classical fluorescence microscopy is relatively easy to do in both living organisms and 
fixed tissue. In addition, fluorescent emission does not require the addition of a cofactor 
or coenzyme and depends on an autocatalytic post-translational process. The 
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fluorophore, consisting of nine amino acids, is characterized by the formation of a cycle 
between serine 65 and glycine 67, which gives rise to an intermediate imidazolidine 5, 
followed by oxidation of tyrosine 66, transforming it into dehydrotyrosine (Heim et al., 
1994). This group is found inside a cylinder composed of 1 1 p layers, which constitutes 
an environment that interacts directly with the chromophore (Yang et ah, 1996). 

[049] Monitoring calcium fluxes in real time could help to understand the 
development, the plasticity, and the functioning of the central nervous system. In 
jellyfish, the chemiluminescent, calcium binding, aequorin protein is associated with the 
green fluorescent protein (GFP), and a green bioluminescent signal is emitted upon Ca** 
stimulation. Aequorin alone is difficult to detect on the cellular and subcellular level 
owing to the weak emission of photons after excitation. 

[050] The development of a new marker sensitive to calcium with a higher 
quantum yield was therefore initiated. This invention utilizes Chemiluminescence 
Resonance Energy Transfer (CRET) between the two molecules. Calcium sensitive 
bioluminescent reporter genes have been constructed by fusing GFP and aequorin 
resulting in much more light being emitted. 

[05 1] Chemiluminescent and fluorescent activities of these fusion proteins have 
been assessed in mammalian cells. Cystosolic Ca^ increases were imaged at the single 
cell level with a cooled intensified CCD (coupled charge device) camera. This 
bifunctional reporter gene should allow the investigation of calcium activities in 
neuronal networks and in specific subcellular compartments in transgenic animals. 

[052] GFP-aequorin Fusion Proteins as Ca^-Activated Reporter Genes. 

[053] According to this invention, a fusion protein has been constructed with 
aequorin and GFP to increase the quantum yield of Ca^-induced bioluminescence. This 
activity can not be increased simply by co-expressing GFP with aequorin (data not 
shown). A thermoresistant GFP (Gm) was fused in frame with the NH 2 terminal region 
of apoaequorin (Fig. 1), since the C-terminal proline residue has been shown to be 
implicated in the Ca^-activated bioluminescent process (20). 

[054] Different constructs have been made with increasing size of linker 
between GFP and apoaequorin. The shortest spacer is formed by 5 amino acids and the 
longest by 50 amino acids (Fig. 1). All the fusion proteins showed a better Ca" 1 ^- 
triggered bioluminescent activity than aequorin alone. The increases of light emitting 
activity ranged from 19 to 65 times (Table 1) possibly because of greater protein 
stability. 
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TABLE 1 

CA++ INDUCED CHEMILUMINESCENCE ACTIVITIES 



Name 


Mean ± SEM* 




RLUx 10 6 / lOUpgal 


pA 


0.15 (0.10; 021) 


pGa 


10.01 ±4.4 


pGIA 


2.96 (3.39; 2.53) 


pG2A 


8.39 (9.54; 7.23) 


pG4A 


7.78 (12.02; 3.53) 


pG5A 


8.15 ± 1.72 



*SEM is indicated when more than two measures were made. 
Otherwise the two measures are given. 



[055] The plasmids identified in Table 1 are described in detail hereafter. The 
following sequence identifiers are used to describe the amino acid and nucleotide 
sequences of each plasmid insert. 



TABLE 2 
SEQUENCE IDENTIFIERS 



Plasmid Insert 


Amino Acid Sequence 


Nucleotide Sequence 


A 


★ 


* 


GA 


SEQ ID NO: 1 


SEQ ID NO: 7 


GIA 


SEQ ID NO: 2 


SEQ ID NO: 8 


G2A 


SEQ ID NO: 3 


SEQ ID NO: 9 


G4A 


SEQ ID NO: 4 


SEQ ID NO: 10 


G5A 


SEQ ID NO: 5 


SEQ ID NO: 1 1 


SeG5A 


SEQ ID NO: 6 


12 



*The nucleotide sequence of apoaequorin is contained in U.S. 5,422,266. 



[056] The identity of the linker used in these constructs is as follows: 
DNA sequence of GFP-aequorin linker 

pGA (strain 1-2507) TCC GGC CTC AGA TCT [SEQ ID NO: 13] 

pGIA (strain 1-2508) TCC GGC GGG AGC GGA TCC GGC GGC CAG TCC 

GGC CTC AGA TCT [SEQ ID NO: 14] 

pG2A (strain 1-2509) TCC GGC GGG AGC GGA TCC GGC GGC CAG TCC 

GGC GGG AGC GGA TCC GGC GGC CAG TCC GGC CTC 
AGA TCT [SEQ ID NO: 15] 

pG4A (strain 1-2510) TCC GGC GGG AGC GGA TCC GGC GGC CAG TCC 

GGC GGG AGC GGA TCC GGC GGC CAG TCC GGC GGG 
AGC GGA TCC GGC GGC CAG TCC GGC GGG AGC GGA ' 
TCC GGC GGC CAG TCC GGC CTC AGA TCT 
[SEQ ID NO: 16] 
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pG5A (strain 1-2511) TCC GGC GGG AGC GGA TCC GGC GGC CAG TCC 

GGC GGG AGC GGA TCC GGC GGC CAG TCC GGC GGG 
AGC GGA TCC GGC GGC CAG TCC GGC GGG AGC GGA 
TCC GGC GGC CAG TCC GGC GGG AGC GGA TCC GGC 
GGC CAG TCC GGC CTC AGA TCT [SEQ ID NO: 17] 

pSeG5A (strain -1-2512) and pStGSA (strain 1-2513) same linker 
sequence as pG5A. 



Peptide sequence of linker 



pGA 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ ID 


NO: 


18] 












pGIA 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ 


ID NO: 


19] 






















pG2A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ ID 


NO: 


20] 


pG4A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 




Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ 




ID NO: 21] 






















pG5A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 




Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 




Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ ID 


NO: 


22] 







pSeG5A and pStG5A idem than pGSA 



[057] Plasmids containing the foregoing polynucleotides have been deposited at 
the Collection Nationale de Cultures de Microorganismes ("C.N. CM."), Institut Pasteur, 
28, rue du Docteur Roux, 75724 Paris Cedex 15, France, as follows: 



Insert 


Plasmid 


Accession No. 


Deposit Date 


A 


pAeq+ 


1-2506 


June 22, 2000 


GA 


pGa 


1-2507 


June 22, 2000 


G1A 


pGIA 


1-2508 


June 22, 2000 


G2A 


pG2A 


1-2509 


June 22, 2000 


G4A 


pG4A 


1-2510 


June 22, 2000 


G5A 


pG5A 


1-2511 


June 22, 2000 


SeG5A 


pSeG5A 


1-2512 


June 22, 2000 


StG5A 


pStG5A 


1-2513 


June 22, 2000 



[058] Recombinant apoaequorin is unstable within the cytosol, with a half-life 
of approximately 20 minutes (21). In contrast, GFP is a very stable protein and probably 
stabilizes apoaequorin in the chimeric proteins. The turnover times of the different 
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cytosolic proteins were estimated on transient expression in COS 7 cells by treatment 
with puromycin (50/^g/ml) for 6 hours. Over this period, most fusion proteins presented 
a 30% decrease of activity compared with the 80% loss of apoaequorin when alone 
(Figure 5). It has been observed that, in vitro, the fusion proteins of the invention were 
more sensitive than aequorin alone. G5A gives a significant signal over background 
with Ca"^ concentration as low as 38 nM, whereas aequorin needs 28 times more 
calcium (l|iM) to yield a comparable signal (Figure 6). Energy transfer may also 
improve the quantum yield of GFP-aequorin allowing a more efficient calcium ions 
detection. To discriminate among the factors contributing to the higher light emission, it 
will be necessary to study the relaxation mechanisms of the GFP fluorescent excited 
state on purified hybrid proteins. 

[059] More generally, one embodiment of this invention provides a chimeric 
protein starting with the genes for GFP and aequorin. Improved quantum yield will 
depend on the functional coupling of the proteins by a Chemiluminescence Resonance 
Energy Transfer (CRET) mechanism. Thus, after the reconstitution of aequorin and its 
binding with calcium ions, the activated aequorin transmits its energy to the GFP, which 
in turn emits a green light to return to its ground state. Optimizing the functional 
coupling between the two proteins has focused on three points: 

1. Improving the induction of a conformational change in the GFP at 37°C, 
which leads to a higher emission of GFP in the mammalian cells; 

2. Changing to the use of aequorin codons adapted to mammalian cells to 
enhance its expression; and 

3. Adding a linkage peptide between the two proteins. 

[060] With respect to the third point, an initial molecular construct with five 
amino acids separating the two proteins was first completed. Then a sequence of nine 
amino acids was added in a sequence of one to five copies. These constructs were 
placed in a eukaryote expression vector under control of the CMV (cytomegalovirus) 
promoter allowing their functional ability to be confirmed. These fusion proteins may 
be identified: (i) by the GFP signal, through excitation of the biological preparations 
with light of wavelength 470 nm, by fluorescence microscopy (FITC filter); (ii) by 
aequorin activity, through emission of blue light after binding with Ca^ ions. 

[061] The following terms have the following meanings when used herein: 
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Luminescence 

[062] Emission of an electromagnetic radiation from an atom or molecule in 
UV, in visible or IR. This emission results from the transition from an electronically 
excited state towards a state from weaker energy, generally the ground state. 
Fluorescence 

[063] Fluorescence produced by a singlet, very short, excited electronically. 
This luminescence disappears at the same time as the source from excitation. 
Chemiluminescence 

[064] Luminescence resulting from a chemical reaction. 
Bioluminescence 

[065] Visible chemiluminescence, produced by living organisms. The 
invention mimics the system naturally present in the jellyfish, without fixation to a 
support. 

Bioluminescent system 

[066] The bioluminescent system according to the invention is a chimeric 
tripartite molecule within the middle a peptide linker and a coenzyme (i.e., 
coelenterazine). The first molecule and the second molecule covalently attached with 
the linker can be everything if they have for the first a donor site and for the second an 
acceptor site attached on it (receptors-linker-ligand, antibody-linker antigen). The 
chimeric protein can be fused to a fragment of tetanus toxin for its retrograde and 
transynaptic transport on axon by Coen, L., Osta, R., Maury, M., and Brulet, P., 
Construction of hybrid proteins that migrate retrogradely and transynaptically into the 
central nervous system. Proc. Natl. Acad. Sci. (USA) 94 (1997) 9400-9405, or fused to 
a membrane receptor. 
Non-radiative 

[067] No emission of photon from aequorin to the GTP when aequorin is 
bounded by calcium ions (therefore there is no transmission of blue light by aequorin in 
the invention, the energy transfer is directly made between the two proteins). 
FRET system 

[068] Transfer of energy by resonance by fluorescence (i.e., between two 
variants of GFP). 

References 

Fluorescent indicators for Ca2+ based on green fluorescent proteins and 
calmodulin. 
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Miyawaki, A., Llopis, J., Heim, R, McCaffery, J.M., Adams, J.A., Ikura, M. and 
Tsien, R. Y. Nature, (1997) Vol. 388 pp. 882-887. 

Detection in living cells of Ca2+-dependent changes in the fluorescence emission 
of an indicator composed of two green fluorescent protein variants linked by a 
calmodulin-binding sequence. A new class of fluorescent indicators. 
Romoser, V.A., Hinkle, P.M. and Persechini, A., J. Biol. Chem., (1997) Vol. 
272, pp. 13270-13274. 

CRET 

[069] Transfer of energy by resonance by chemiluminescence (i.e., fusion 
protein with GFP-aequorin (jellyfish Aequorea) but without linker or GFP-obeline). 
Reference: 

Chemiluminescence energy transfer. 

Campbell, A.K., in Chemiluminescence: Principles and application in Biology 
and Medicine, Eds Ellis Horwood, Chichester, UK 1988, pp. 475-534. 

BRET 

[070] Transfer of energy by resonance by bioluminescence (i.e. , interaction 
between GFP and luciferase (jellyfish Renilla). 
Reference: 

A bioluminescence resonance energy transfer (BRET) system: application to 
interacting circadian clock protein. 

Xu, Y., Piston, D.W. and Johnson, C.H. Proc. Natl. Acad. Sci., (USA) (1999) 
Vol. 96, pp. 151-156. 

Application 1: Study of calcium signals from a cell population in a eukaryotic organism. 

[071] Targeting the bioluminiscent protein sensitive to calcium in a cell 
population or in a specific tissue may be achieved through homologous recombination or 
by transgenesis under the control of a specific promoter. Replacing genes by 
homologous recombination in embryonic cells in mice, such, as Hoxc-8 and Otxl, with 
this new marker will make it possible to obtain new lines of mutant mice. This approach 
will permit the detection of electrical activity in a group of neural cells, and will also 
make it possible to complete the phenotype study of mutants obtained by replacing the 
ZacZgene(LeMouellice/a/. ? 1990, 1992; Acampora etal, 1996). For the Hoxc-8 
locus, the expression of the marker should be located in the ventral horns of the spinal 
chord beginning at section C7 (Le Mouellic et al, 1990). Anomalies in the somatotopic 
organization of the motor neurons innervating these muscles have been brought to light 
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(Tiret et ah, 1998), and a study of the role of the flux of calcium in the establishment of 
these neural connections during development may thus be undertaken. In the Otxl 
model, the transgene should be expressed in specific regions of the forebrain, given that 
an expression localized at layers V and VI of the cerebral cortex, and in regions of the 
diencephalon, mesencephalon, and cerebellum have been shown (Frantz et al, 1994). 
Mutant mice obtained by the replacement of the gene by the LacZ gene show a 
reduction in the thickness of the cerebral cortex and anomalies in the hippocampus, 
mesencephalon, and cerebellum (Acampora et al 9 1996). The loss of balance and 
rotatory movement observed in these mice can presumably be attributed to anomalies in 
the sensory organs, specifically in the eye and inner ear. These mice are also subject to 
generalized epileptic seizures. The establishment of faulty connections and/or abnormal 
electrical activity could be implicated in the genesis of these pathological processes 
(McNamara, 1992). The use of this new marker will, on the one hand, make it possible 
to verify these hypotheses through a functional and dynamic approach, and on the other, 
to address the development of epilepsy in the adult as well as during development. 
Application 2: Study of the role of intracellular calcium 

[072] Calcium is involved in a large number of cellular mechanisms, such as 
cellular migration, membrane excitability, mitochondrial metabolism, secretion, mitosis, 
and synaptic plasticity (Berridge et al, 1998). Coding calcium information at the 
cellular and subcellular level is complex, involving spatial, temporal and quantitative 
factors. Targeting marker of the invention to different subcellular compartments is 
possible by fusion with a peptide signal, for example, synoptotagmine. 

[073] Example A: Targeting the nuclear compartment will make it possible to 
study the role of calcium in transcription activation mechanisms and during the 
mechanisms related to programmed cell death (apoptosis). 

[074] Example B: Targeting two fusion proteins with GFP produces different 
emission spectra in the two cell compartments, for example, cytoplasm and the 
endoplasmic reticulum will make it possible to study the regulation of the calcium flux 
during cell activations. 

[075] Example C: Targeting the fusion protein in the synapses will make it 
possible to study the calcium activity linked to electrical activity in neural cells during 
the release of neurotransmitters. The first possibility is the achievement of a triple 
fusion between a synaptic protein, such as synaptotagmine or SNAP25, GFP, and 
aequorin. The existence of protein-protein interactions during exocytosis makes it 
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possible to consider a second possibility: A functional coupling between GFP and 
aequorin, the one in fusion with a vesicular protein and the other with a plasma protein. 
A signal will be obtained only during the interaction of the different proteins in the 
presence of an increase in the calcium ion concentration. 
Application 3: Study of calcium signals at the cell population level 

[076] Triple fusing of a protein having intercellular transport properties such as 
fragment C of the tetanus toxin (TTC) or the VP22 protein of the herpes virus with GFP 
and aequorin will make it possible to observe the calcium activity in a population of 
connected cells, for example in a neural network. 

Description of the construction of a bioluminescent marker expression vector 
sensitive to calcium ions 

Stage 1: pEGFP-CldKS (Kvnl-Smal Deletion) 

[077] Double digestion of pEGFP-Cl plasmid (Clontech, see figure) with Kpnl 

and Smal enzymes. After blunt ending the Kpnl extension with "Mung bean" nuclease, 

the two extremities are ligated. 

5' GTC GAC GGT ACC GCG GGC CCG GGA TCC 3* 
3' CAG CTG CCA TGG CGC CC G GGC CC T AGG 5' 

Kpnl Smal 

GTC GAC GGT AC G GGA TCC 

CAG CTG C C CCT AGG 

GTC GAC G G GGA TCC 

CAG CTG C C CCT AGG 

u 

GTC GAC GGG GAT CC 
CAG CTG CC C CTA GG 
Sail BamHl 
Stage 2: pEGFP-CImut (GFP mutagenesis) 

[078] Four mutagenesis oligonucleotides were used on a single-strand molecule 
prepared using pEGFP-CldKS. Each oligonucleotide comprises one or several 
mismatches (identified below in lower case letters), causing the desired mutation. In the 
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pEGFP-Clmut plasmid chosen, cut with the Sacll enzyme but not the Agel enzyme, all 
of the mutations were verified by sequencing. 

- Destruction of the Agel site, introduction of a SacU site and deletion of a Valine codon 
normally absent in "wild-type" GFP (Prasher, D.C., Eckenrode, R.K., Ward, W.W., 
Prendergast, F.G., and Cormier, M.J., Primary structure of the Aequorea victoria green- 
fluorescent protein. Gene 111 (1992)229-233.) 





SacIX 


Met 




Ser 


Lys 


Gly 


Asp 


oGMl : 


5- 1 GCGCTACCGcggGCCACC 


ATG 




AGC 


AAG 


GGC 


GAG 3' 


pEGFP-CldKS: 


5 ' GCGCTACCGGTCGCCACC 


ATG 


GTG 


AGC 


AAG 


GGC 


GAG 3' 



Agel Val 



- Replacement of the 163 Valine codon by an Alanine codon in order to increase the 
quantity of GFP assuming a correct conformation at 37°C (Siemering, K.R., Golbik, R., 
Sever, R., and Haseloff, J., Mutations that suppress the thermosensitivity-of green 
fluorescent protein. Current Biol. 6 (1996) 1653-1663.) 









lie 


Lys 


Ala 


Asn 


Phe 


Lys 




oGM2 : 


5' 


GC 


ATC 


AAG 


Gcc 


AAC 


TTC 


AAG 


3' 


pEGFP-CldKS 


5 T 


GC 


ATC 


AAG 


GTG 
Val 


AAG 


TTC 


AAG 


3 ' 



- Replacement a 23 1 Leu codon by a Histidine codon normally present in "wild-type" 
GFP (Prasher, D.C, Eckenrode, V.K., Ward, W.W., Prendergast, F.G., and Cormier, 
M. J., Primary structure of the Aequorea victoria green-fluorescent protein. Gene 1 1 1 
(1992) 229-233.) 



lie Thr His Asn Met 
°GM3 : 5 1 GG ACT ATC CaC GGC ATG GA 3 T 

pEGFP-CldKS : 5' GG ACT ATC CTC GGC ATC GA 3 T 

Leu 

Stage 3: pEGFPmut-Aeq (GFP-Aeguorin fusion protein) 

[079] Four PCRs (Polymerase Chain Reaction) done on a vector comprising the 
aequorin (Aeq) coding phase makes it possible to amplify the A, B, C, and D fragments 
with, respectively, the primers oAE5A and oAE3A, oAE5B and oAE3B, oAE5C and 
oAE3C, oAE5D and oAE3D. The overlapping regions are used to assemble the 
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different parts during successive PCRs (Ho, S.N., Hunt, H.D., Horton, R.M., Pullen, 
J.K., and Pease, L.R. Site-directed mutagenesis by overlap extension using the 
polymerase chain reaction Gene 77 (1989) 51-59.) An A+B fragment is amplified 
starting with a mixture of A and B fragments, and the primers oAE5A and oAE3B. 
Similarly, a OD fragment is amplified with a mixture of C and D fragments, using the 
primers oAESA and oAE3D. Finally, the complete coding phase, A+B+C+D is 
developed with the primers oAE5A and oAE3D. 

• Each oligonucleotide comprises one or several mismatches that 

are identified below in lower case. The "wild" sequence is represented 
opposite, in upper case. The primer oAE5A suppresses the original 
initiation translation code (ATG) and introduces a BgJIL site. The primer 
oAE3D introduces an^ol site just behind the translation terminal codon 
(TAA). The final PCR product, digested with the Bglll and XIiol 
enzymes, is cloned in the BglTI-Sall sites of the pEGFP-Clmut plasmid 
in such a way that the Valine codon (GTC), the first codon of aequorin, is 
in the same reading phase as the GFP (see figure). The other primers 
introduce "silent" mutations that do not change the protein sequence but 
modify six codons in the jellyfish, Aequoria victoria, to improve their 
expression in mammals (Wada, K-N., Aota, S.-L, Tsuchiya, R., Ishibashi, 
F., Gojobori, T., and Ikemura, T. Codon usage tabulated from the 
GenBank genetic sequence data.- Nucleic Acids Res. 18 suppl. (1990) 
2367-241 1.). The completeness of the entire sequence was verified by 
sequencing, 



OAE5A CCATG 

AGCTTCAgatct GTC AAA CTT ACA TCA GAC TTC GAC AAC CCA AGA TGG ATT GGA CGA 
TCGAAGTctaca CAG TTT GAA TGT AGT CTG AAG CTG TTG GGT TCT ACC TAA CCT GCT 
BG1II 



CAC AAG CAT ATG TTC AAT TTC CTT GAT GTC 
GTG TTC GTA TAC AAG TTA AAG GAA CTA CAG 
ATG GTC TAC AAG GCA TCT GAT ATT GTC ATC 
TAC CAG ATG TTC CGT AGA CTA TAA CAG TAG 



AAC CAC AAT GGA AAA ATC TCT CTT GAC GAG 
TTG GTG TTA CCT TTT TAG AGA GAA CTG CTC 
AAT AAC CTT GGA GCA ACA CCT GAG CAA GCC 
TTA TTG GAA CCT CGT TGT GGA CTC GTT CGG 
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OAE5B A 

AAA CGA CAC AAA GAT GCT GTg GAA GCC TTC TTC GGA GGA GCT GGA ATG AAA TAT GGT GTG 

TTT GCT GTG TTT CTA CGA CAc CTT CGG AA G AAG CCT CCT CGA CCT TAG TTT ATA CCA CAC 

T oAE3A 

GAA ACT GAT TGG CCT GCA TAT ATT GAA GGA TGG AAA AAA TTG GCT ACT GAT GAA TTG GAG 

CTT TGA CTA ACC GGA CGT ATA TAA CTT CCT ACC TTT TTT AAC CGA TGA CTA CTT AAC CTC 

OAE5C G T A 

AAA TAC GCC AAA AAC GAA CCA ACc CTC ATC CGc ATc TGG GGT GAT GCT TTG TTT GAT ATC 

TTT ATG CGG TTT TTG CTT GGT TGg GAG TAG GCg TAg ACC CCA CTA CGA AAC AAA CTA TAG 

C AT OAE3B 

GTT GAC AAA GAT CAA AAT GGA GCT ATT ACA CTG GAT GAA TGG AAA GCA TAC ACC AAA GCT 

CAA CTG TTT CTA GTT TTA CCT CGA TAA TGT GAC CTA CTT ACC TTT CGT ATG TGG TTT CGA 



GCT GGT ATC ATC CAA TCA TCA GAA GAT TGC 

CGA CCA TAG TAG GTT AGT AGT CTT CTA ACG 

GAA AGT GGA CAA CTC GAT GTT GAT GAG ATG 

CTT TCA CCT GTT GAG CTA CAA CTA CTC TAC 



ATG GAT CCT GCT TGC GAA AAG CTC TAC GGT 
TAC CTA GGA CGA ACG CTT TTC GAG ATG CCA 



GAG GAA ACA TTC AGA GTG TGC GAT ATT GAT 
CTC CTT TGT AAG TCT CAC ACG CTA TAA CTA 

OAE5D A T A 

ACA AGA CAg CAT cTg GGA TTT TGG TAC ACC 
TGT TCT GTc GTA gAc CCT AAA ACC ATG TGG 
T AT OAE3C 

Xhol 

GGA GCT GTC CCC TAA TCTcGAGGATCTTT 3 T 
CCT CGA CAG GGG ATT AGAgC T CCTAGAAA 5 1 

T oAE3D 



Stage 4: pGCA (Insertion of an Intercalated sequence) 

[080] In the pEGFPmur-Aeq plasmid, a sequence of five amino acids exists 
between the coding phases of the GFP and aequorin. Observations led to the 
lengthening of this region by intercalating a sequence in the BspEL site. Two 
complementary oligonucleotides coding for a sequence of nine amino acids give the 
composition a good deal of flexibility, owing to the abundance of Glycine and Serine. 
After insertion, the BspEl site is preserved on only one side although new intercalated 
sequences may be added successively. At each stage, the orientation is controlled by the 
BspEl enzyme. Two copies of this sequence are needed to restore the normal 
fluorescence of GFP, but the energy transfer between aequorin and GFP is optimal with 
five copies. The entire intercalated sequence of pGCA plasmid (5x9aa + the five 
initial amino acids = 50 aa) was verified by sequencing: 
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Lys 


Ser 


Gly 


Leu 


Arg 


Ser 


Val 






5' 


AAG 


TCC 


GGA 


CTC 


AGA 


TCT 


GTC 


3 1 






TTC 


AGG 


CCT 


GAG 


TCT 


AGA 


CAG 


5 ! 






GFP 


BspEI 






BG1II 


Aeq 












U 












5 T 


AAG 


T 


GC 


GGA 


CTC 


AGA 


TCT 


GTC 


3 1 


3 1 


TTC 


Ann 




T 

+ 


cue 


TCT 


AGA 


LAb 


D 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 




5 T CC GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


T 


3 T 


3 1 G 


CCC 


TCG 


CCT 


AGG 


CCG 


CCG 


GTC 


AGG 


CC 



BairiHI BspEI 

[081] Optimization of the energy transfer by inserting a spacer between GFP 
and Apoaequorin. 

[082] A non-radiative energy transfer between the excited oxyluciferin and the 
GFP chromophore will be strongly dependent upon their overall geometry and their 
respective motions. Therefore, a linker was designed principally composed of serine 
and glycine residues to intercalate a flexible element of variable length. 

[083] The ratio of green and blue photons emitted upon Ca 4 "* triggering has 
been measured on cellular extracts prepared 48h after transient transfection of Neuro2A 
cells. The photons emitted through a beam-splitter were counted after passing 
appropriate filters. Covalent linking of GFP to aequorin (GA) significantly modified the 
wavelength of maximum light emission (Fig. 2), thereby demonstrating intramoiecular 
energy transfer. The ratio of green over blue light (500/450nm) was further raised from 
3 to around 7 by adding 1 to 5 linkers (Fig. 2, CRET). Preliminary measurement 
indicates that this ratio can reach almost 1 1 with SG5A probably because of the 
accumulation of the fusion protein anchored to the membranes (see materials and 
methods). 

[084] Spectral emissions of the different constructs were also analyzed using a 
monochromator. Aequorin showed a broad spectrum with maximum wavelength at 474 
± 6.9 nm and a bandwidth, corresponding to the distance between low and high 
wavelengths at 50% values of the maximum emission, at 108.3 + 20.1 nm (Fig. 2). 
There was a clear shift toward the green in the peak emission of the GFP-aequorin 
constructions ranging from 506.7 + 1.2 nm to 514. 1 + 3.4 nm. Increasing the length of 
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the linker further affected the sharpness of the spectrum, as indicated by the narrower 
bandwidths, 88.4 + 9.4 nm and 56.0 + 3.3 nm 5 for pGA and pG5A respectively. There 
was no evidence of a bimodal spectrum with any of the pGlA-pG5A constructs 
indicating an optimal transfer which could be incomplete in the case of pGA. 

[085] When the spacer between GFP and aequorin is longer than 14 amino 
acids, the donor and the acceptor dipoles have probably more freedom to be in a 
configuration favourable for optimum intramolecular energy transfer. The system of the 
invention yields an efficiency comparable to the intermolecular CRET measured in vivo 
(22, 23) and provides a convenient modei for the biophysical studies of radiationless 
energy transfer mechanisms. 

[086] Cellular localization and targeting of GFP-Apoaequorin. 

[087] The cellular localization of the GFP-apoaequorin constructs has been 
examined. Figure 3 illustrates GFP activity 48h after transient transfection in Neuro2A 
cells. Expression of the mutant GFP alone (Gm) showed homogenous fluorescence in 
the cytosol as well as in the nucleus as expected since GFP is a small protein that can 
diffuse into the nucleus. Mutation VI 63 A improves remarkably the fluorescence signal 
and reduces photobleaching when compared to the original EGFP (data not shown) 
probably owing to a higher concentration of properly folded protein. An evenly 
distribution is also observed for all the GFP-apoaequorin constructions in Neuro2A cells 
(Fig. 3A-D) as well as in COS-7 cells. Bright spots often appeared in the cytosol with 
fusion proteins having the shortest linkers: GA, G1A and G2A. These spots were less 
frequent with G4A and never observed with Gm and G5 A. High concentrations of 
proteins expressed during transient transfections could induce the aggregation of GFP 
(24), which is also going to be influenced by the presence of the aequorin protein and 
the distance separating them. 

[088] The GFP-apoaequorin has also been targeted to the neurotransmitter 
vesicles with a complete or a partial synaptotagmin I molecule. Synaptotagmin I is a 
transmembrane protein of synaptic vesicles and is implicated in neurotransmitter 
exocytosis (25). For imaging calcium microdomains in presynaptic compartments, the 
signal should be more accurate than if evenly distributed in the cytoplasm of neurons. In 
a three part fusion protein, SG5A (Fig. 1), the complete coding sequence of 
synaptotagmin I has been put in frame upstream of G5A. In this case, GFP fluorescence 
is superimposable with synaptotagmin immunostaining but is also visible at the cellular 
surface (Fig. 3E). In neurons (26) and in Neuro2A cells, synaptotagmin I is localized in 
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neuronal processes, but is undetectable in plasma membranes, probably because the 
dynamic mechanisms of exocytosis are followed by rapid endocytosis. When GFP- 
apoaequorin is fused with only the N-terminal part of synaptotagmin including the 
transmembrane domain but lacking the cytoplasmic domain (tSG5A, Fig. 1), a strong 
fluorescence is restricted to the cytosol (Fig. 3F). The punctate labeling suggests that 
this protein is locked into the trans-golgi system. The correct targeting of the three part 
fusion molecule of the invention does not occur with tSG5A and appears to be slowed 
down in the case of SG5A. When fused to the complete synaptotagmin protein, the 
bioluminescent marker is held back in the plasma membrane, but nevertheless labels all 
neurite outgrowths present in Neuro2A cells. 
[089] Ca"^ detection in single cells. 

[090] Neuro2A cells were transiently transfected with pA, pGA, pG2A, pG5A 
or cotransfected with pA and pGm (Fig. 1). After aequorin reconstitution with native 
coelenterazine in Ca 4 ^ - free buffer, an emission of photons has been measured with a 
classical intensified CCD camera upon the addition of CaC12 solution (5 mM) (Fig.4A. 1 
and 4A.4). With the negligible background (Fig. 4A.2), integration time of 1 second is 
enough to record the signal in single cells (Fig.4A. 1) expressing any of the fusion 
proteins. No signal could be visualized with aequorin alone or with co-expressed free 
GFP (data not shown). The presence of unbound GFP does not improve aequorin 
chemiluminescence as we observed in vitro. Because of the low level of light produced, 
aequorin expressed in situ has never been detected in single cells except when targeted 
in mitochondria. With a cooled intensified CCD camera, Rutter et al. (1996) (27) have 
succeeded in detecting intramitochondrial Ca"^ signals when aequorin is fused to 
cytochrome c oxidate. Transgenes encoding cytoplasmic aequorin can report calcium 
activities in monolayers of cells only when photomultipliers (PMT) are used, which are 
more sensitive but lack the spatial resolution for single cell analysis. The stability of 
GFP-aequorin fusions of the invention and the improved light emission have made it 
possible to detect physiological Ca" 1 ^ signals at the level of single cells. 

[09 1 ] Calcium deficiency prior to measurements or the transfection conditions 
used may induce cellular depolarization, such that opening of the voltage dependent 
Ca** channels is likely to be responsible for the fast bioluminescent response to CaCl 2 , 
addition (Fig. 4A). Light emission would then return to background level because of the 
desensitization of Ca""" channels and the membrane depolarization by Ca^-dependent K' 
channels (28). Fluo-3 showed a similar profile in mock transfections of Neuro2A cells 
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(Fig. 4C). Subsequent addition of a Ca^ ionophore (A23 1 87) induced a second 
emission of photons with comparable intensity but with different kinetics. A lower light 
intensity is detectable in Neuro2A cells transfected with pSG5A (Fig. 4B). When a 
fluorescent calcium probe is anchored to the inner surface of the membrane, the 
response kinetics are much quicker than when the probe is not targeted (29). The use of 
the bioluminescent reporter SG5A probably requires a system with higher spatial and 
temporal resolutions. In any case, the responses observed are not due to the complete 
consumption of aequorin as more bioluminescence can still be observed when a 
concentrated Ca* 4 " solution (lOOmM) is applied to cells (see Fig. 4B for example). For 
each construction, measurements have been repeated at least 4 times. A variability of 
individual cells responses was observed, probably due to cell population heterogeneity. 
Further investigations are required to calibrate relative light unit (RLU) versus Ca" 1 ^ 
concentrations. Patch-clamp techniques will also allow the identification of the type of 
calcium channels implicated in these responses and the effect of cellular transfection on 
membrane potential. 

[092] The transgenes of the invention should permit imaging of electrical 
activity in neural networks in whole animals. In vitro, two approaches were used until 
recently. The first method is based on the coupling of exocytosis to emission of light 
from synaptolucins in nerve cells (30). Light emission occurs when the luciferase, 
targeted inside the synaptic vesicles, reacts with ATP in the extracellular space. With 
this system, the authors obtain signals correlated with the neurotransmitter release but 
the low light level requires very long acquisition times (over 30 sec). In the second 
approach, fluorescence Ca 4 ^ sensitive markers have been used for measurements of 
intracellular [Ca* 4 ] by FRET (3, 4, 31). For single cell detection, this technique requires 
a sufficient concentration of probe to discriminate the signal from the background which 
is generated by autofluorescence of biological compounds and the possibility of 
calcium-independent energy transfer between the two GFPs. The integration times are 
also relatively long, between 4 and 20 seconds. 

[093] This invention thus provides new bifunctional hybrids in which 
expression patterns can be followed by GFP fluorescence while the aequorin moiety is 
the reporter of Ca"^ activity. Furthermore, the functional coupling of the two 
components, which follows the CRET principle, results in a higher amount of light 
emission and a greater Ca^" sensitivity. Bioluminescent activities of these genetic 
markers have been assessed in single cells with a cooled intensified CCD camera in 1 
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second integration times. The recent development of low level light detection systems 
should allow detection of CRET signals with much shorter integration times and higher 
spatial resolution. Intracellular and intercellular Ca"^ signaling can be approached in 
vivo in transgenic animals in which the GFP-aequorin is targeted to a particular cell 
population and/or to specific subcellular compartments. Particularly, calcium 
oscillations can then be imaged simultaneously in cells of an integrated neural circuitry 
in real time. 

[094] This invention will be described in greater detail in the following 
Examples. 

EXAMPLE 1 
Construction of GFP-aequorin fusion proteins 

[095] All the constructs were made in the pEGFP-Cl vector (Clontech). The 
EGFP gene is codon-optimized for maximal expression in mammalian cells. It also 
contains 2 mutations in the chromophore, F64L and S65T, which modify the excitation 
spectra and enhance fluorescence intensity (17). Valine 163 of the EGFP was also 
substituted by alanine, using single strand mutagenesis, to improve the proper folding of 
the protein and increase the fluorescence at 371 C (1 8, 19). The aequorin coding 
sequence, a generous gift by M.-T. Nicolas, has been fused in frame at the 3' end of the 
EGFP gene in the BgEI/Sall sites of pEGFP-Cl. Seven codons were modified for a 
better expression in mammalian cells by means of site-directed mutagenesis using PCR 
(polymerase chain reaction) with overlap extension. Then, complementary 
oligonucleotides, 5 -CCGGCGGGAGCGGATCCGGCGGCC AGT-3 1 [SEQ ID NO: 23] 
and 5 , -CCGGACTGGCCGCCGGATCCGCTCCCG-3 , [SEQ ID NO: 24] were inserted 
at the BspEl site in the 15 bp sequence between GFP and aequorin. Conservation of the 
BspEl site at only one end allowed sequential addition of one to five linker sequences 
(pGlA-pG5A). 

[096] Two additional fusion constructs were made in pG5 A with a synaptic 
protein, synaptotagmin I of which the cDNA plasmid was generously gift by M. Fukuda. 
Sequences encoding for either the entire open reading frame or the first 134 N-terminal 
amino acids, comprising the transmembrane domain of the protein, were fused in frame 
at the 5' end of the GFP-aequorin gene. 
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EXAMPLE 2 
Cell Culture and transfection 

[097] Neuroblastoma cells (Neuro2A, mouse) were grown in Dulbecco's Eagle 
medium (Life Technologies - Gibco, UK) supplement with 10% (V/V) heat-treated 
foetal calf serum, 2mm glutamine (Life Technologies - Gibco, UK) and 100 units 
streptomycin-penicillin (Life Technologies - Gibco, UK). The culture were incubated at 
37°C in a humidified atmosphere containing 8% C02 and transiently transfected using 
either the CaP0 4 , technique or the FuGENE 6™ transfection reagent (Roche). 

EXAMPLE 3 

In vitro Ca++ sensitive chemiluminescence and CRET activities 

[098] Cells were harvested 48h after transfection in 250ul of lOmM 0- 
mercaptoethanol, 4mM EDTA, 5//M coelenterazine in PBS at 4°C during 2 to 4 hours. 
Cells were rinsed in ImM EDTA in PBS and harvested in 400//1 of hypo-osmotic buffer 
(20mM Tris-HCl pH 7.5/ 5mM EDTA/ 5mM p-mercaptoethanol with a protease 
inhibitor cocktail according to the manufacturer, Roche), for 30min. to lh. at 4°C. The 
cell membranes were broken by passing through a 30 gauge needle and the cellular 
extract was obtained after microcentrifugation at 13000 rpm for lh at 40C. The 
supernatant was harvested for all constructions but SGSA for which the membrane pellet 
was further resuspended. Calcium sensitivity chemiluminescent activity was measured 
in a luminometer (Lumat LB95501 E&EG Berthold). Aliquots (10ul) were placed in 
sample tube (with 90ul of lOmM Tris-HCl pH 7.5) in the luminometer and the light 
intensity expressed in relative light unit (R.L.U.) was measured after the injection of 
100|a.l of 50mM CaCl 2 /10mM Tris-HCl pH 7.5 solution. 

[099] For CRET measurements, aliquots of extracts from transfected cells were 
placed in a reservoir chamber and brought into contact with an optic fibre bundle 
attached to a photon counting camera (Photek three-microchannel plate intensified CCD 
camera: Photek 216). Before capture of signals, light passes through a monochromator 
allowing the spectral analysis of emitted photons. The acquisition begins 20 seconds 
before injection of CaCl 2 and carries on during 40 seconds after injection of the CaCl 2 
solution (50mM). For green/blue photons ratio determinations, the same procedure was 
followed but in this case the system measures the light emitted through blue (450nm) 
and green (500nm) filters after a beam splitter. 
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EXAMPLE 4 
GFP fluorescence and immunolocalization 

[0100] Neuro2A cells were fixed 48h after transfection in 4% paraformaldehyde 
in PBS pH 7.4, rinsed in PBS, and mounted. GFP fluorescence is visualized under a 
confocal Laser Scanning microscope (Zeiss, Heidelberg, Germany) which uses an 
argon-krypton laser operating in multi-line mode or an Axiophot microscope with an 
epiluminescent system (Zeiss, Heidelberg, Germany). For immunolocalisation of the 
targeted GFP-aequorin, fixed cells were pre-treated with 50mM NH4CI in PBS pH 7.4 
for 5 min. at room temperature, permeabilised in 2% BSA/ 0.02% Triton/goat serum 
solution in PBS during lh. Antibodies against synaptotagmin (StressGen SYA-130) 
were then applied during 2-4 hrs. Cells were then rinsed in PBS and incubated in 2% 
BSA/ 0.02% Triton in PBS with secondary antibody diluted at 1/100 (TRITC conjugated 
antibody). Cells were then washed in PBS and mounted. 

EXAMPLE 5 
Single cells bioluminescence detection 
[0101] Forty-eight hours after transfection, cells were rinsed in 124mM NaCl/ 
5mMKCl/ 15mM HepespH7.4/ 5mMNaHC0 3 / lmMNaH 2 P0 4 / 0.5mM MgSQ 4 / 1. 
5mM CaCl 2 / 5.5 mM Glucose and later incubated in the same buffer without CaCl 2 
with 5pM coelenterazine to reconstituted aequorin, for 2 to 4h at 37°C and then rinsed. 
Calcium signals were visualized with a modified Olympus upright microscope (BHS) 
fitted with an BH2-RFCA epifluorescence unit recorded through a plan x40 Olympus 
long working distance water-immersion lens (N.A. 0.7). GFP Fluorescence allowed to 
choose the recording area on transfected cells. The excitation lamp was shut off and the 
gain of the camera increased. Images were integrated every second with a cooled 
Photonic Science extended ISIS video camera. Each profile in Figure 4 represents the 
amount of light emitted over the area that we defined around the soma of individual cells 
using the Axon Imaging Workbench 2214 software. Intensities of fluorescence and 
CRET activity are translated in scaled pseudocolors. Controls were made with Fluo-3 
AM on mock-transfected Neuro2A cells to check the experimental conditions. 

EXAMPLE 6 
Protein stability 

[0102] The turnover times of the different cytosolic proteins were estimated on 
transient expression in COS7 cells by treatment with puromycin (50jag/ml) for 6h. 
Ca -induced chemiluminescence activities were performed on cellular extract obtained 
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after the reconstitution of aequorin in presence of SpM coelenterazine. Calcium 
sensitivity chemiluminescence activity was measured in a luminometer (Lumat 
LB95501 E&EG Berthold). Aliquots (lOfil) were placed in a sample tube (with 90|al of 
lOmM Tris-HCl, pH 7.5) in the luminometer and the light intensity expressed, in relative 
light units (RLUs), was measured after the injection of lOOpl of 50mM CaCl 2 /10mM 
Tris-Hcl pH 7.5 solution. Relative chemiluminescence activities are expressed as a 
percentage of the activity at the time zero (100%). The results are shown in Fig. 5. As 
seen in Fig. 5, over this period, most fusion proteins presented 30% decrease of activity 
compared with the 80% loss of apoaequorin when alone. 

EXAMPLE 7 

Determination of the Ca++ affinity of aequorin and G5A 

[0103] Ca 2+ ~ induced chemiluminescence activities were performed on cellular 
extract obtained after the reconstitution of aequorin in presence of 5jxM coelenterazine. 
Calcium sensitivity chemiluminescence activity was measured in a luminometer (Lumat 
LP95501 E&EG Berthold). Aliquots (lOjil) were placed in a sample tube (with 9G|nl of 
lOmM Tric-HCl, pH 7.5) in the luminometer and the light intensity expressed, in 
relative light units (RLUs), was measured after the injection of 100]td of different 
Ca/EGTA solutions. The results are shown in Fig. 6. As seen in Fig. 6, G5A gives a 
significant signal over background with Ca 2+ concentrations as low as 38 nM, whereas 
aequorin needs 28 times more calcium (1 M) to yield a comparable signal. 

For Chimeric GFP-aequorin as bioluminescent Ca 2+ reporters at the single cell level 

Concerning the invention of chimeric GFP-aequorin calcium sensitive 
bioluminescent reporters, new applications have been developed and some preliminary 
datas have been obtained about sensitivity of GFP-aequorin proteins to Ca 2+ ions. 

EXAMPLE 8 

Ca sensitivity of G5A and SG5A: Calibration curves between bioluminescent 

signals and Ca 2+ concentrations 

Measurements of Ca 2+ sensitivity of two constructs G5A and SG5A were 
performed on cellular extracts obtained after the reconstitution of aequorin in presence 
of 5jxM colenterazine. Calcium chemiluminescence activity was measured in a 
luminometer (Lumat LB95501 E&EG Berthold). Aliquots (lOpl) were placed in a 
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sample tube with 90 ]^1 of lOmM Tris.HCl pH 7.5 in the luminometer and the light 
intensity expressed, in relative light units (RLUs), was measured after the injection of 
100ml of different Ca/EGTA solutions (Molecular Probes Calcium Calibration Buffer 
Kit). Figure 7 shows the Ca 2+ response curve of G5A, SG5A and aequorin. The curves 
represent the relationship between the ratio L/Lmax and [Ca2+]. L is the rate of RLUs 
at any given [Ca2+] and Lmax is the rate of RLUs at saturating [Ca2+]. These results 
show a much higher affinity for Ca 2+ of the various forms of GFP-aequorin than 
aequorin. 

EXAMPLE 9 
New applications of GFP-aequorin reporters 

Adenoviral vectors with GFP-aequorin were developed. Using these new 
constructs, dissociated neurons from rat spinal cord in culture can be transfected with 
higher efficiency. Figures 8 and 9 depict Ca 2+ -induced bioluminescent signals detected 
at the single cell level in dissociated neuronal cells. Neuronal cells infected by 
adenoviral vectors with G5A (Fig. 8) or SG5A (Fig. 9) were pre-incubated with 5jjM 
coelenterazine in a Ca 2+ -free buffer. Intensities of fluorescence and bioluminescence 
activity are translated in pseudocolors. Representative pictures of the chosen fields are 
shown after the addition of 5mM and 2.5mM of CaCl 2? respectively, for Figures 8a-c & 
9a at 12 and 9 seconds. Figures 8d-e and 9b were obtained after addition of ionomycin 
and high concentration of CaCl 2 (lOOmM). 

EXAMPLE 10 

Expression of GFP-aequorin reporters in vivo in Xenopus embryos and 
measurement of calcium activities 

Calcium signalling during early and late embryogenesis in Xenopus was studied. 
Figure 10 shows representative pattern of luminescence activity illustrating the changes 
in intracellular calcium during the neural induction after the injection of the GA plasmid 
at the one cell stage in Xenopus embryo. Figure 1 1 shows a transgenic Xenopus larva 
with GFP-aequorin. These techniques can also be employed with zebrafish and mouse 
transgenics. These results show that these calcium reporters can be used in a great 
variety of organisms or tissues to visualize calcium activity and to measure calcium 
concentrations. 

[0104] In summary, the new linker useful for energy transfer by CRET system in 
a bioluminescent system has the following properties: 
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Forms: 

[0105] Different amino acid sequences and peptide sequences of the linker are 
described. Its length comprises a minimal size of 4 to 9 amino acids, which can be 
extended by a group of 7 to 12 amino acids (in a preferred embodiment 9 amino acids). 
The said group is extendable to 63 amino acids, i.e., 9 x 6 times. The experiment was 
done, for example, with a peptide linker comprising 5 amino acids followed by 1 to 5 
times of 9 amino acids. 
Functions: 

[0106] Its first function is to approach donor sites and acceptor sites of two 
molecules for a direct transmission of energy. This linker confers an optimal 
environment for energy transmission by CRET. 

[0107] The second function is the stabilization of the described system by 
increasing the half life of aequorin because of the fusion of GFP. The aequorin is linked 
to the GFP, which has a half life of more than 24 hours. 
Applications: 

[0108] In a bioluminescent system, aptitude for protein-protein interaction. 

[0109] Application of the bioluminescent system with the linker: 
epileptogenesis, SNC disease (visualization of the neuronal cell activities during 
development and in the adult), neuromuscular connection with the implication of 
homeogene HOX-C8 in the spinal cord. 

[0110] Application in apoptosis with a chimeric protein comprising the linker 
according to the invention by the visualization of the modifications of the intracellular 
calcium pools. 

[0111] Visualization and precision of the role of calcium waves in living organs 
like the spleen (intra and intercellular calcium waves). 
Results: 

[0112] Chimeric protein is more stable by augmentation of the half-life of the 
molecule. Augmentation of the sensitivity for calcium ions is important. 

[0113] The linker of the invention has surprising properties. The sensitivity of 
calcium ions of the chimeric molecule containing the aequorin and the linker is different 
from that for aequorin alone. The invention provides a better sensitivity. 

[0114] This linker makes it possible to attach together an aequorin molecule with 
a GFP. The following reference demonstrates that the both molecules do not interact 
together without a linker: Morise, H. Shimomura, O., Johonson, F.H. and Winant, J. 
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(1974) Intermolecular Energy Transfer in the bioluminescent system of Aequoria. 
Biochemistry 13, 2656-2662. 

[0115] It is the first time that one can obtain visualization of aequorin signal in a 
live single cell system (or in an alive animal). 

[0116] In summary, monitoring calcium fluxes in real time could help to 
understand the development, the plasticity and the functioning of the central nervous 
system. In jellyfish, the chemiluminescent calcium binding aequorin protein is 
associated with the green fluorescent protein (GFP) and a green bioluminescent signal is 
emitted upon Ca^ stimulation. We decided to use this Chemiluminescence Resonance 
Energy Transfer (CRET) between the two molecules. Calcium sensitive bioluminescent 
reporter genes have been constructed by fusing GFP and aequorin resulting in much 
more light being emitted. Chemiluminescent and fluorescent activities of these fusion 
proteins have been assessed in mammalian cells. Cystosolic Ca~^ increases were imaged 
at the single cell level with a cooled intensified CCD camera. This bifunctional reporter 
gene should allow the investigation of calcium activities in neuronal networks and in 
specific subcellular compartments in transgenic animals. 

[0117] Following are sequences and the corresponding sequence identifiers 
referred to herein: 
Peptide sequences: 
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age 


cgc 


tac 


ccc 


gac 


tec 


gee 


atg 


ccc 


gaa 


ggc 


tac 


gtc 


cag 


gag 


aac 


tac 


aag 


acc 


cgc 


gee 


gag 


gtg 


aag 


ttc 


gag 


ctg 


aag 


ggc 


ate 


gac 


ttc 


aag 


gag 


gac 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cqc 


cac 


aac 


ate 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


ggc 


gac 


tac 


ctg 


age 


acc 


cag 


tec 


gee 


ctg 


age 


aaa 


gtc 


ctg 


ctg 


gag 


ttc 


gtg 


acc 


gee 


gee 


ggg 


aag 


tec 


gga 


etc 


aGA 


TCT 


gtc 


aaa 


ctt 


aca 


gga 


cga 


cac 


aag 


cat 


atg 


ttc 


aat 


ttc 


ctt 


ctt 


gac 


gag 


atg 


gtc 


tac 


aag 


gca 


tct 


gat 


cct 


gag 


caa 


gee 


aaa 


cga 


cac 


aaa 


gat 


get 


atg 


aaa 


tat 


ggt 


gtg 


gaa 


act 


gat 


tgg 


cct 


get 


act 


gat 


gaa 


ttg 


gag 


aaa 


tac 


gee 


aaa 


ggt 


gat 


get 


ttg 


ttt 


gat 


ate 


gtt 


gac 


aaa 


gaa 


tgg 


aaa 


gca 


tac 


acc 


aaa 


get 


get 


ggt 


gaa 


aca 


ttc 


aga 


gtg 


tgc 


gat 


att 


gat 


gaa 


aca 


aga 


caG 


cat 


CtG 


gga 


ttt 


tgg 


tac 


acc 


ggt 


gga 


get 


gtc 


ccc 












ggg 


gtg 


gtg 


ccc 


ate 


ctg 


gtc 


gag 


ctg 


gac 


gtg 


tec 


ggc 


gag 


ggc 


gag 


ggc 


gat 


gec 


acc 


tgc 


acc 


acc 


ggc 


aag 


ctg 


ccc 


gtg 


ccc 


tgg 


ggc 


gtg 


cag 


tgc 


ttc 


age 


cgc 


tac 


ccc 


gac 


tec 


gec 


atg 


ccc 


gaa 


ggc 


tac 


gtc 


cag 


gag 


aac 


tac 


aag 


acc 


cgc 


gec 


gag 


gtg 


aag 


ttc 


gag 


ctg 


aag 


ggc 


ate 


gac 


ttc 


aag 


gag 


gac 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cgc 


cac 


aac 


ate 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


ggc 


gac 


tac 


ctg 


age 


acc 


cag 


tec 


gee 


ctg 


age 


aaa 


gtc 


ctg 


ctg 


gag 


ttc 


gtg 


acc 


gec 


gec 


ggg 


aag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tea 


gac 


ttc 


gac 


aac 


cca 


aga 


tgg 


att 


gga 


gat 


gtc 


aac 


cac 


aat 


gga 


aaa 


ate 


tct 


ctt 


att 


gtc 


ate 


aat 


aac 


ctt 


gga 


gca 


aca 


cct 


gtG 


gaa 


gec 


ttc 


ttc 


gga 


gga 


get 


gga 


atg 


gca 


tat 


att 


gaa 


gga 


tgg 


aaa 


aaa 


ttg 


get 


aac 


gaa 


cca 


acC 


etc 


ate 


cgC 


ata 


tgg 


ggt 


gat 


caa 


aat 


gga 


get 


att 


aca 


ctg 


gat 


gaa 


ate 


ate 


caa 


tea 


tea 


gaa 


gat 


tgc 


gag 


gaa 
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aca 


ttc 


aga 


gtg 


tgc 


gat 


att 


gat 


gaa 


agt 


gga 


caa 


etc 


gat 


gtt 


gat 


gag 


atg 


aca 


aaa 


caG 


cat 


CtG 


gga 


ttt 


tgg 


tac 


acc 


atg 


gat 


cct 


get 


tgc 


gaa 


aag 


etc 


tac 


ggt 


aaa 


get 


gtc 


ccc 


[SEQ ID 


NO: 


8] 
























G2A 






































Atg 


age 


aag 


ggc 


gag 


gag 


ctg 


ttc 


acc 


ggg 


gtg 


gtg 


ccc 


ate 


ctg 


gtc 


gag 


ctg 


gac 


aac 


crac 


gta 


aac 


ggc 


cac 


aag 


ttc 


age 


gtg 


tec 


ggc 


gag 


ggc 


gag 


ggc 


gat 


gec 


acc 


tac 


qqc 


aag 


ctg 


ace 


ctg 


aag 


ttc 


ate 


tgc 


acc 


acc 


ggc 


aag 


ctg 


ccc 


gtg 


ccc 


tgg 


ccc 


ace 


etc 


gtg 


ace 


acc 


ctg 


acc 


tac 


ggc 


gtg 


cag 


tgc 


ttc 


age 


cgc 


tac 


ccc 


gac 


cac 


atg 


aag 


cag 


cac 


gac 


ttc 


ttc 


aag 


tec 


gee 


atg 


ccc 


gaa 


ggc 


tac 


gtc 


cag 


gag 


cgc 


ace 


ate 


ttc 


ttc 


aag 


gac 


gac 


ggc 


aac 


tac 


aag 


acc 


cgc 


gee 


gag 


gtg 


aag 


ttc 


gag 


ggc 


gac 


ace 


ctg 


gtg 


aac 


cgc 


ate 


gag 


ctg 


aag 


ggc 


ate 


gac 


ttc 


aag 


gag 


gac 


ggc 


aac 


ate 


ctg 


ggg 


cac 


aag 


ctg 


gag 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


atg 


gee 


gac 


aag 


cag 


aag 


aac 


ggc 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cgc 


cac 


aac 


ate 


gag 


gac 


ggc 


age 


gtg 


cag 


etc 


gee 


gac 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


ggc 


gac 


ggc 


ccc 


gtg 


ctg 


ctg 


ccc 


gac 


aac 


cac 


tac 


ctg 


age 


acc 


cag 


tec 


gee 


ctg 


age 


aaa 


aac 


ccc 


aac 


gag 


aag 


cgc 


gat 


cac 


atg 


gtc 


ctg 


ctg 


gag 


ttc 


gtg 


acc 


gee 


gec 


ggg 


ate 


act 


cAc 


ggc 


atg 


gac 


gag 


ctg 


tac 


aag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tac 


ggc 


etc 


aGA 


TCT 


gtc 


aaa 


ctt 


aca 


tea 


aac 


ttc 


gac 


aac 


cca 


aga 


tgg 


att 


gga 


cga 


cac 


aag 


cat 


atg 


ttc 


aat 


ttc 


ctt 


gat 


gtc 


aac 


cac 


aat 


gga 


aaa 


ate 


tct 


ctt 


gac 


gag 


atg 


gtc 


tac 


aag 


gca 


tct 


gat 


att 


gtc 


ate 


aat 


aac 


ctt 


gga 


gca 


aca 


cct 


gag 


caa 


gee 


aaa 


cga 


cac 


aaa 


gat 


get 


gtG 


gaa 


gee 


ttc 


ttc 


gga 


gga 


get 


gga 


atg 


aaa 


tat 


ggt 


gtg 


gaa 


act 


gat 


tgg 


cct 


gca 


tat 


att 


gaa 


gga 


tgg 


aaa 


aaa 


ttg 


get 


act 


gat 


gaa 


ttg 


gag 


aaa 


tac 


gee 


aaa 


aac 


gaa 


cca 


acC 


etc 


ate 


cgC 


ata 


tgg 


ggt 


gat 


get 


ttg 


ttt 


gat 


ate 


gtt 


gac 


aaa 


gat 


caa 


aat 


gga 


get 


att 


aca 


ctg 


gat 


gaa 


tgg 


aaa 


gca 


tac 


acc 


aaa 


get 


get 


ggt 


ate 


ate 


caa 


tea 


tea 


gaa 


gat 


tgc 


gag 


gaa 


aca 


ttc 


aga 


gtg 


tgc 


gat 


att 


gat 


gaa 


agt 


gga 


caa 


etc 


gat 


gtt 


gat 


gag 


atg 


aca 


aga 


caG 


cat 


CtG 


gga 


ttt 


tgg 


tac 


acc 


atg 


gat 


cct 


get 


tgc 


gaa 


aag 


etc 


tac 


ggt 


gga 


get 


gtc 


ccc 


[SEQ ID 


NO: 


9] 







G4A 



Atg 


age 


aag 


ggc 


gag 


gag 


ctg 


ttc 


acc 


ggg 


gtg 


gtg 


ccc 


ate 


ctg 


gtc 


gag 


ctg 


gac 


ggc 


gac 


gta 


aac 


ggc 


cac 


aag 


ttc 


age 


gtg 


tec 


ggc 


gag 


ggc 


gag 


ggc 


gat 


gee 


acc 


tac 


ggc 


aag 


ctg 


acc 


ctg 


aag 


ttc 


ate 


tgc 


acc 


acc 


ggc 


aag 


ctg 


ccc 


gtg 


ccc 


tgg 


ccc 


acc 


etc 


gtg 


acc 


acc 


ctg 


acc 


tac 


ggc 


gtg 


cag 


tgc 


ttc 


age 


cgc 


tac 


ccc 


gac 


cac 


atg 


aag 


cag 


cac 


gac 


ttc 


ttc 


aag 


tec 


gee 


atg 


ccc 


gaa 


ggc 


tac 


gtc 


cag 


gag 


cgc 


acc 


ate 


ttc 


ttc 


aag 


gac 


gac 


ggc 


aac 


tac 


aag 


acc 


cgc 


gee 


gag 


gtg 


aag 


ttc 


gag 


ggc 


gac 


acc 


ctg 


gtg 


aac 


cgc 


ate 


gag 


ctg 


aag 


ggc 


ate 


gac 


ttc 


aag 


gag 


gac 


ggc 


aac 


ate 


ctg 


ggg 


cac 


aag 


ctg 


gag 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


atg 


gee 


gac 


aag 


cag 


aag 


aac 


ggc 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cgc 


cac 


aac 


ate 


gag 


gac 


ggc 


age 


gtg 


cag 


etc 


gec 


gac 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


ggc 


gac 


ggc 


ccc 


gtg 


ctg 


ctg 


ccc 


gac 


aac 


cac 


tac 


ctg 


age 


acc 


cag 


tec 


gee 


ctg 


age 


aaa 


gac 


ccc 


aac 


gag 


aag 


cgc 


gat 


cac 


atg 


gtc 


ctg 


ctg 


gag 


ttc 


gtg 


acc 


gee 


gee 


ggg 


ate 


act 


cAc 


ggc 


atg 


gac 


gag 


ctg 


tac 


aag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


etc 


aGA 


TCT 


gtc 


aaa 


ctt 


aca 


tea 


gac 


ttc 


gac 


aac 


cca 


aga 


tgg 


att 


gga 


cga 


cac 


aag 


cat 


atg 


ttc 


aat 


ttc 


ctt 


gat 


gtc 


aac 


cac 


aat 


gga 


aaa 


ate 


tct 


ctt 


gac 


gag 


atg 


gtc 


tac 


aag 


gca 


tct 


gat 


att 


gtc 


ate 


aat 


aac 


ctt 


gga 


gca 


aca 


cct 


gag 


caa 


gee 


aaa 


cga 


cac 


aaa 


gat 


get 


gtG 


gaa 


gec 


ttc 


ttc 


gga 


gga 


get 


gga 


atg 


aaa 


tat 


ggt 


gtg 


gaa 


act 


gat 


tgg 


cct 


gca 


tat 


att 


gaa 


gga 


tgg 


aaa 


aaa 


ttg 


get 


act 


gat 


gaa 


ttg 


gag 


aaa 


tac 


gec 


aaa 


aac 


gaa 


cca 


acC 


etc 


ate 


cgC 


ata 


tgg 


ggt 


gat 


get 


ttg 


ttt 


gat 


ate 


gtt 


gac 


aaa 


gat 


caa 


aat 


gga 


get 


att 


aca 


ctg 


gat 


gaa 


tgg 


aaa 


gca 


tac 


acc. 


aaa 


get 


get 


ggt 


ate 


ate 


caa 


tea 


tea 


gaa 


gat 


tgc 


gag 


gaa 


aca 


ttc 


aga 


gtg 


tgc 


gat 


att 


gat 


gaa 


agt 


gga 


caa 


etc 


gat 


gtt 


gat 


gag 


atg 


aca 


aga 


caG 


cat 


CtG 


gga 


ttt 


tgg 


tac 


acc 


atg 


gat 


cct 


get 


tgc 


gaa 


aag 


etc 


tac 


ggt 


gga 


get 


gtc 


ccc 


[SEQ ID 


no: 


10] 









G5A 

Atg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg gtc gag ctg gac 
ggc gac gta aac ggc cac aag ttc age gtg tec ggc gag ggc gag ggc gat gec acc 
tac ggc aag ctg acc ctg aag ttc ate tgc acc acc ggc aag ctg ccc gtg ccc tgg 
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ccc 


acc 


etc 


y *-y 


acc 


acc 


ctg 


acc 


tac 


CaC 


at g 


aag 


cag 


cac 


gac 


ttc 


ttc 


aag 


CCfC 


acc 


ate 


ttc 


ttc 


aag 


gac 


gac 


acre 
yy° 


a pt pf 
y ay 


yy u 


gac 


acc 


ctg 


cft a 

y i_y 


aac 


cgc 


ate 




aac 


ate 


ctg 


yyy 


cac 


aag 


ctg 


craa 
y u y 


3 1 Cf 


gec 


gac 


aag 


cag 


aag 


aac 


Cf CfC 

yy^ 


ate 


pt 

gag 


y d.u 


pf rrr 


age 


y u y 


cag 


etc 


gec 


gac 


ggc 


CCC 


gtg 




P* +~ iT 


r* r* 


ptpi p" 
y du 


Pi Pi p* 


par" 


gac 


CCC 


aac 


pf pi pf 
y ay 


aag 


cgc 


gat 


cac 


atg 


ate 


act 


cAc 


GCf O 


atg 


gac 


cracf 

y ay 


ctg 


tac 


tec 


cine* 


ctctct 


agC 


crcra 
yy a 


tec 


Pf Pf Pf. 

yy u 


crcrc 
yy u 


cag 




rrrTri 
yyy 


age 


aaa 
yy w 


tec 


aac 


CTCfC 

yy u 


caa 


tec 


etc 


aGA 


TCT 


gtc 


aaa 


ctt 


aca 


tea 


gac 


aag 


cat 


at g 


ttc 


aat 


ttc 


ctt 


gat 


gtc 


at g 


gtc 


tac 


aag 


gca 


tct 


gat 


att 


gtc 


gec 


aaa 


cga 


cac 


aaa 


gat 


get 


gtG 


gaa 


rrrft 

yy L 


pf t <t 

y L y 


gaa 


act 


gat 


t Cf CJ 
L-y y 


cc t 


gca 


tat 


gaa 


ttg 


gag 


aaa 


tac 


gee 


aaa 


aac 


aaa 
y wu 


ttg 


ttt 


gat 


ate 


gtt 


gac 


aaa 


gat 


caa 


gca 


tac 


acc 


aaa 


get 


get 


ggt 


ate 


ate 


aga 


gtg 


tgc 


gat 


att 


gat 


gaa 


aqt 


gga 


cat 


CtG 


gga 


ttt 


tgg 


tac 


acc 


atg 


gat 


gtc 


ccc 


[SEQ ID 


NO: 


11] 
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CTCfC 
\i x - 


CftCf 


cag 


tgc 


ttc 


age 


cgc 


tac 


ccc 


gac 


t cc 


gec 


atg 


CCC 


gaa 


aac 


tac 


gtc 


cag 


aaa 


aac 


tac 


aag 


acc 


cgc 


acc 


aaa 


ata 

rJ ^ 


aag 


ttc 


aaa 


ctg 


aag 


aac 


ate 


aac 


ttc 


aaa 


aaa 

3" 3 


gac 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cgc 


cac 


aac 


ate 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


aac 
y y ^ 


gac 


tac 


ctg 


age 


acc 


cag 


tec 


gec 


ctg 


age 


aaa 


gtc 


ctg 


ctg 


y ay 


ttc 


crt a 

y 


acc 


gec 


gec 


aaa 
yyy 


aag 


tec 


CTCfC 

yy^ 


aaa 


age 


aaa 
yy a 


tec 


aac 


CTCTC 

yy u 


cag 


tec 


CTCfC 


acre 


age 


aaa 


tec 


aac 


aac 

3 3 W 


caa 


tee 


aac 


CTCfCf 


aac 


aaa 


tec 


aac 

\S 3 w 


aac 


caa 


tec 


aac 
3 3 w 


ttc 


gac 


aac 


cca 


aga 


taa 


att 


gga 


cga 


cac 


aac 


cac 


aat 


aaa 

y y ^ 


aaa 


ate 


tct 


ctt 


gac 


aaa 


ate 


aat 


aac 


ctt 


aaa 


gca 


aca 


cct 


aaa 


caa 


gec 


ttc 


ttc 


aaa 


aaa 


get 


aaa 

rJ ^3 " 


atg 


aaa 


tat 


att 


aaa 


aaa 


taa 


aaa 


aaa 


ttg 


get 


act 


gat 


cca 


acC 


etc 


ate 


cgC 


ata 


tgg 


ggt 


gat 


get 


aat 


gga 


get 


att 


aca 


ctg 


gat 


gaa 


tgg 


aaa 


caa 


tea 


tea 


gaa 


gat 


tgc 


gag 


gaa 


aca 


ttc 


caa 


etc 


gat 


gtt 


gat 


gag 


atg 


aca 


aga 


caG 


cct 


get 


tgc 


gaa 


aag 


etc 


tac 


ggt 


gga 


get 



SeGBA 



riiy 


gtg 


agt 


gee 


agt 


p'PT'h 
uy L. 


cct 


gag 


gc c 


ctg 


get 


gec 


cct 


gt c 


acc 


act 




PTP* Pf 

gcg 


pi p'P* 


r*t t 


pet - r* 
y 


P* P* Pl 


p 1 pi p* 

L^a o 


a p p 


pf p*p* 
y 


at i— 


pt ^ pt 

y ay 


p* p*^ 


pf p'P' 

y 


ay l. 


r-* p* "h 


pf Pf pf 

yyy 


pf pi ^ 

y aa 


pf pf pf 
yyy 


p» pi pf 

aay 


Pf Pi Pi 

y aa 


PfP» "T 
y a u 


Pf P'P* 


+-+-+- 


tec 


aay 


p*+" pf 


aag 


p* p< pf 
uay 


aag 


u u u 


atg 


aat 


gaa 


ctg 


p*pi f- 
C-a L. 


aaa 


pi -hp- 
a ll 


p* p* 3 

Ota 


f- i~ n 
l. l. y 


cca 


ccg 


tgg 


gee 


+- 4- -3 


-.4-4- 

all 


gee 


ata 


gee 


a La 


grr 


gcg 


gtc 


r-+" -h 
CLL 


eta 


gtc 


gtg 


acc 


tgc 


tgc 




Ly l 


gtc 


i_y i_ 


aag 


aaa 


uy l 


t-f-pf 

l. Ly 




aaa 


aag 


aaa 


aac 


aag 


aag 


aag 


gg a 


Pi Pi pf 

day 


pf pi pi 

y aa 


pi pi pf 
aay 


gga 


9" 9*9" 


aag 


aac 


gec 


aff 

a L. U 


aac 


pii~a 


-j 

aaa 


gac 


pf t* pf 
9 ^y 


•r> o 
aaa 


gac 


f f a 

L, ua 


Pf PTPf 

yyy 


pi pi pr 

aay 


pt p*p* 


pi trr 
aiy 


p» pi pf 
aay 


gat 


cag 


gec 


ctt 


aag 


gat 


gac 


gat 


get 


gaa 


act 


gga 


ctg 


act 


gat 


gga 


gaa 


gaa 


aag 


gag 


gag 


ccc 


aag 


gaa 


gag 


gag 


aaa 


ctg 


gga 


aag 


ctt 


caa 


tat 


tea 


ctg 


gac 


tat 


gac 


ttc 


cag 


aat 


aac 


cag 


ctg 


ctg 


gtg 


gga 


ate 


ate 


cag 


get 


get 


gaa 


ctg 


ccc 


gee 


ctg 


gac 


atg 


gga 


ggc 


aca 


tct 


gat 


cca 


tac 


gtc 


aaa 


gtc 


ttc 


ctg 


ctg 


ccc 


gac 


aaa 


aag 


aag 


aag 


ttt 


gag 


aca 


aaa 


gtc 


cac 


egg 


aaa 


acc 


etc 


aat 


cca 


gtc 


ttc 


aat 


gaa 


cag 


ttt 


act 


ttc 


aag 


gtg 


cca 


tac 


teg 


gaa 


tta 


ggt 


ggc 


aag 


aca 


ctg 


gtg 


atg 


get 


gtg 


tat 


gat 


ttt 


gac 


cgc 


ttc 


tec 


aag 


cac 


gac 


ate 


att 


gga 


gag 


ttc 


aaa 


gtt 


cct 


atg 


aac 


acc 


gtg 


gat 


ttt 


ggc 


cac 


gtc 


acc 


gag 


gag 


tgg 


cgc 


gat 


etc 


cag 


agt 


get 


gag 


aaa 


gaa 


gag 


caa 


gag 


aaa 


ctg 


ggt 


gac 


ate 


tgc 


ttc 


tec 


etc 


cgc 


tac 


gtc 


cct 


act 


gee 


ggc 


aag 


ctg 


act 


gtt 


gtc 


att 


ctg 


gaa 


gec 


aag 


aac 


ctg 


aag 


aag 


atg 


gat 


gtg 


qgt 


ggc 


tta 


tct 


gat 


ccc 


tat 


gta 


aag 


att 


cac 


ctg 


atg 


cag 


aac 


ggc 


aag 


aga 


ctg 


aag 


aag 


aaa 


aag 


aca 


acg 


att 


aag 


aag 


aac 


aca 


ctt 


aac 


ccc 


tac 


tac 


aat 


gag 


tec 


ttc 


age 


ttt 


gaa 


gtt 


ccg 


ttc 


gag 


caa 


ate 


cag 


aaa 


gtg 


caa 


gtg 


gtg 


gta 


act 


gtt 


ttg 


gac 


tat 


gac 


aag 


att 


ggc 


aag 


aac 


gac 


gee 


ate 


ggc 


aaa 


gtc 


ttt 


gtg 


ggc 


tac 


aac 


age 


acc 


ggc 


gca 


gag 


ctg 


cga 


cac 


tgg 


tea 


gac 


atg 


ctg 


gee 


aac 


ccc 


egg 


cgc 


ccc 


ate 


gee 


cag 


tgg 


cac 


act 


ctg 


cag 


gta 


gag 


gag 


gag 


gtt 


gat 


gee 


atg 


ctg 


get 


gtc 


aag 


aGA 


tec 


GGG 


AAT 


TCC 


GGG 


CGG 


gec 


acc 


atg 


age 


aag 


ggc 


gag 


gag 


ctg 


ttc 


acc 


ggg 


gtg 


gtg 


ccc 


ate 


ctg 


gtc 


gag 


ctg 


gac 


ggc 


gac 


gta 


aac 


ggc 


cac 


aag 


ttc 


age 


gtg 


tec 


ggc 


gag 


ggc 


gag 


ggc 


gat 


gee 


acc 


tac 


ggc 


aag 


ctg 


acc 


ctg 


aag 


ttc 


ate 


tgc 


acc 


acc 


ggc 


aag 


ctg 


ccc 


gtg 


ccc 


tgg 


ccc 


acc 


etc 


gtg 


acc 


acc 


ctg 


acc 


tac 


ggc 


gtg 


cag 


tgc 


ttc 


age 


cgc 


tac 


ccc 


gac 


cac 


atg 


aag 


cag 


cac 


gac 


ttc 


ttc 


aag 


tec 


gee 


atg 


ccc 


gaa 


ggc 


tac 


gtc 


cag 


gag 


cgc 


acc 


ate 


ttc 


ttc 


aag 


gac 


gac 


ggc 


aac 


tac 


aag 


acc 


cgc 


gee 


gag 


gtg 


aag 


ttc 


gag 


ggc 


gac 


acc 


ctg 


gtg 


aac 


cgc 


ate 


gag 


ctg 


aag 


ggc 


ate 


gac 


ttc 


aag 


gag 


gac 


ggc 


aac 


ate 


ctg 


ggg 


cac 


aag 


ctg 


gag 


tac 


aac 


tac 


aac 


age 


cac 


aac 


gtc 


tat 


ate 


atg 


gee 


gac 


aag 


cag 


aag 


aac 


ggc 


ate 


aag 


gCC 


aac 


ttc 


aag 


ate 


cgc 


cac 


aac 


ate 


gag 


gac 


ggc 


age 


gtg 


cag 


etc 


gec 


gac 


cac 


tac 


cag 


cag 


aac 


acc 


ccc 


ate 


ggc 


gac 


ggc 


ccc 


gtg 


ctq 


ctg 


ccc 


gac 


aac 


cac 


tac 


ctg 


age 


acc 


cag 


tec 


gee 


ctg 


age 


aaa 


gac 


ccc 


aac 


gag 


aag 


cgc 


gat 


cac 


atg 


gtc 


ctg 


ctg 


gag 


ttc 


gtg 


acc 


gee 


gee 


ggg 


ate 


act 


cAc 


ggc 


atg 


gac 


gag 


ctg 


tac 


aag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 


cag 


tec 


ggc 


ggg 


age 


gga 


tec 


ggc 


ggc 
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LUL. 


ggc 


yyy 


o.y 0 


yy d 


LOO 


rrrrf 




Gcig 


tec 


yy c 


rrnn 

yyy 


ciy 0 


yy d 


tec 


yyo 


yyo 


os.g 


tec 




ggg 


35c 




tec 




yy c 


Lay 


4- 

LOO 


ggc 


O LL 




4. O J. 


gtc 


CXdCL 


r ff- 

O L L 


dOd 


tea 


gac 


LLC- 


gac 


aac 


y O d 


aga 


tgg 


a 1 t- 


y y d 


Ly a. 


LctL 


ddy 


r-af 
Oct L 


Piter 


L LO 


aaf 
clct L 


j- -f 
L LO 


r f f 

O L L 


gat 


gt C 


aac 


cac 


a =» t- 

dd L 


gga 


aa a 


u LL 


L O L 


O L L 


ga c 


q a 9 


atg 


gtc 


f ^ r 
LclO 


aag 


gca 


L O L 


gat 


att 


gt C 


ate 


^ a -f- 


aac 


C-LL 


gga 


gca 


aca 


cct 


gag 


caa 


gec 


aaa 


eg a 


cac 


aaa 


gat 


get 


y lo 


gaa 


gec 


t- -h r* 
LLC 


LLC 


gga 


gga 


get 


gga 


d Ly 


aaa 


LdL 


crqt 


qtg 


gaa 


QL L 


yd u 


tgg 


cct 


gca 


t a t 
La L 


a i-t- 

u L L 


ycta 


yy d 


+" rf rr 

u y g 


^ ^ j=i 

ddd 


p Ci 


t trr 
l ty 


get 


dO L 


y ct l 


y cio 


f f a 
l l y 


gag 


aaa 


tac 


y 00 


A P\ Pt 

dd d 


aac 


gaa 


cca 


acC 


etc 


ate 


cgC 


ata 


tgg 


ggt 


gat 


get 


ttg 


ttt 


gat 


ate 


gtt 


gac 


aaa 


gat 


caa 


aat 


gga 


get 


att 


aca 


ctg 


gat 


gaa 


tgg 


aaa 


gca 


tac 


ace 


aaa 


get 


get 


ggt 


ate 


ate 


caa 


tea 


tea 


gaa 


gat 


tgc 


gag 


gaa 


aca 


ttc 


aga 


gtq 


tgc 


gat 


att 


gat 


gaa 


agt 


gga 


caa 


etc 


gat 


gtt 


gat 


gag 


atg 


aca 


aga 


caG 


cat 


CtG 


gga 


ttt 


tgg 


tac 


ace 


atg 


gat 


cct 


get 


tgc 


gaa 


aag 


etc 


tac 


ggt 


gga 


get 


gtc 


ccc 


[SEQ ID 


NO: 


12] 







DNA sequence of GFP-aeguorin linkers 

pGA (strain 12507) TCC GGC CTC AGA TCT [SEQ TD NO: 13] 

pGIA (strain 12508) TCC GGC GGG AGC GGA TCC GGC GGC CAG TCC 

GGC CTC AGA TCT [SEQ ID NO: 14] 



pG2A 


(strain 


12509) 


TCC 


GGC 


GGG AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 








GGC 


GGG 


AGC GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC CTC 








AGA 


TCT 


[SEQ ID 


NO: 


15] 










pG4A 


(strain 


12510) 


TCC 


GGC 


GGG AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 








GGC 


GGG 


AGC GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC GGG 








AGC 


GGA 


TCC GGC 


GGC 


CAG 


TCC 


GGC 


GGG 


AGC GGA 








TCC 


GGC 


GGC CAG 


TCC 


GGC 


CTC 


AGA 


TCT 


[SEQ ID 



NO: 16] 



pG5A (strain 12511) 



TCC 


GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 




GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC 


GGG 


AGC 


GGA 


TCC 


GGC 


GGC 


CAG 


TCC 


GGC 


CTC 


AGA 


TCT 










[SEQ ID 


NO: 


17] 

















pSeG5A (strain 12512) and pStG5A (strain 12513) same linker 
sequence than pG5A. 



Peptide sequence of linkers 

pGA Ser Gly Leu Arg Ser [SEQ ID NO: 18] 

Pgla Ser Gly Gly Ser Gly Ser Gly Gly Gin Ser Gly Leu Arg Ser 
[SEQ ID NO: 19] 



pG2A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ ID 


NO: 


20] 




pG4A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 




Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 






[SEQ ID 


NO: 


21] 






















pG5A 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 




Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 




Gly 


Gly 


Ser 


Gly 


Ser 


Gly 


Gly 


Gin 


Ser 


Gly 


Gly 


Ser 


Gly 


Ser 




Gly 


Gly 


Gin 


Ser 


Gly 


Leu 


Arg 


Ser 


[SEQ ID 


NO: 


22] 







pSeG5A and pStGSA idem than pG5A. 
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WHAT IS CLAIMED IS: 

1 . A composition comprising a purified polypeptide, wherein said composition 
has the functional characteristics to bind calcium ions and to permit a measureable energy, 
said energy depending of the quantity of calcium bound and of the quantity of polypeptides in 
said composition in absence of any light excitation. 

2. A purified polypeptide having the amino acid sequence of SEQ ID NO: 1 . 

3. A purified polypeptide having the amino acid sequence of SEQ ID NO: 2. 

4. A purified polypeptide having the amino acid sequence of SEQ ID NO: 3. 

5. A purified polypeptide having the amino acid sequence of SEQ ID NO: 4. 

6. A purified polypeptide having the amino acid sequence of SEQ ID NO: 5. 

7. A purified polypeptide having the amino acid sequence of SEQ ID NO: 6. 

8. A purified polynucleotide having the sequence of SEQ ID NO: 7. 

9. A purified polynucleotide having the sequence of SEQ ID NO: 8. 

10. A purified polynucleotide having the sequence of SEQ ID NO: 9. 

11. A purified polynucleotide having the sequence of SEQ ID NO: 1 0. 

12. A purified polynucleotide having the sequence of SEQ ID NO: 1 1 . 

13. A purified polynucleotide having the sequence of SEQ ID NO: 1 2. 

14. A polynucleotide linker having the polynucleotide sequence of SEQ ID No: 13. 

15. A polynucleotide linker having the polynucleotide sequence of SEQ ID No: 14. 

16. A polynucleotide linker having the polynucleotide sequence of SEQ ID No: 1 5. 

1 7. A polynucleotide linker having the polynucleotide sequence of SEQ ID No: 1 6. 

18. A polynucleotide linker having the polynucleotide sequence of SEQ ID No: 17. 

19. A polynucleotide linker according to any one of claims 14 to 18 having the 
function after translation to approach a donor site to an acceptor site in optimal conditions to 
permit a direct transfer of energy by Chemiluminescence Resonance Energy Transfer (CRET) 
in a purified polypeptide according to claim 1 . 

20. A peptidic linker of at least 5 amino acids and comprising the amino acid 
sequence of SEQ ID No: 18. 

21 . A peptidic linker of at least 5 amino acids and comprising the amino acid 
sequence of SEQ ID No: 19. 

22. A peptidic linker of at least 5 amino acids and comprising the amino acid 
sequence of SEQ ID No: 20. 

23. A peptidic linker of at least 5 amino acids and comprising the amino acid 
sequence of SEQ ID No: 21. 
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24. A peptidic linker of at least 5 amino acids and comprising the amino acid 
sequence of SEQ ID No: 22. 

25. A peptide linker having the function to approach a donor site to an acceptor site 
in optimal conditions to permit a direct transfer of energy by chemiluminescence in a purified 
polypeptide according to claims 2 to 7. 

26. A peptide linker according to any one of claims 20 to 25, having the function to 
approach a donor site to an acceptor site in optimal conditions to permit a direct transfer of 
energy in the presence of a purified polypeptide according to claim 1. 

27. A peptide linker according to any one of claims 20 to 26, which has the 
capacity to stabilize a modified bioluminescent system in vivo and/or in vitro. 

28. A modified bioluminescent system comprising two bioluminescent proteins 
and a peptide linker according to any one of claims 20 to 27. 

29. A modified bioluminescent system according to claim 28, wherein said two 
bioluminescent proteins comprise at least an aequorin protein. 

30. A modified bioluminescent system according to claims 28 or 29 comprising the 
following constituents : aequorin protein and a GFP protein. 

31. A kit for measuring the transfer of energy in vivo or in vitro and containing at 
least one of the polypeptides according to claims 2 to 7 or the polynucleotide according to 
claims 8 to 13 and the reagents necessary for visualizing or detecting the said transfer in 
presence or in absence of a molecule of interest. 

32. A fusion protein of the formula: 
GFP - LINKER - AEQ; 

wherein GFP is green fluorescent protein; 

AEQ is aequorin; and 

LINKER is a polypeptide of 4-63 amino acids. 

33. The fusion protein as claimed in claim 32, wherein the linker comprises 14-50 
amino acids. 

34. The fusion protein as claimed in claims 32 and 33, wherein the linker 
comprises the following amino acids: 

(Gly Gly Ser Gly Ser Gly Gly Gin Ser [SEQ ID NO: 251]) n , 
wherein n is 1-5. 

35. The fusion protein as claimed in claim 34, wherein n is 1 . 

36. The fusion protein as claimed in claim 34, wherein n is 5. 
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37. A fusion protein for energy transfer from aequorin to green fluorescent protein 
by Chemiluminescence Resonance Energy Transfer (CRET) following activation of the 
aequorin in the presence of Ca^, wherein the fusion protein has the formula: 

GFP - LINKER - AEQ; 
wherein GFP is green fluorescent protein; 
AEQ is aequorin; and 

LINKER comprises the following amino acids: 
(Gly Gly Ser Gly Ser Gly Gly Gin Ser [SEQ ID NO: 25]) n , 
wherein n is 1-5; and 

wherein the fusion protein has an affinity for Ca^ ions and a half-life of at least 24 

hours. 

38. A fusion protein as claimed in claims 32 to 37, wherein the linker includes the 
amino acid sequence Ser Gly Leu Arg Ser [SEQ ID NO: 26]. 

39. A fusion protein as claimed in claims 32 to 38, which further comprises a 
peptide signal sequence for targeting the fusion protein to a cell or to a subcellular 
compartment. 

40. A polynucleotide encoding a fusion protein as claimed in any one of claims 32 

to 39. 

41. A composition according to claim 1, wherein said purified polypeptide is a 
purified polypeptide according to any one of claims 2 to 7, or a modified bioluminescent 
system according to claims 28 to 30, or a fusion protein according to any one of claims 32 to 
39. 

42. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-2507. 

43. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-2508. 

44. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-2509. 

45. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-25 1 0. 

46. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-25 1 1 . 

47. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-2512. 

48. A culture as deposited at the C.N.C.M. and containing the plasmid No. 1-25 1 3 . 

49. A method of screening in vivo a change in a physical, chemical, biochemical or 
biological, condition, the method comprising the steps of: 

a) administering to a mammal a composition according to claim 1 or 41 ; 

b) detecting whether the light is produced; and 

c) optionally measuring the ionic concentration of calcium flux. 
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50. A method of screening in vitro a change in a physical, chemical, biochemical, 
or biological condition, wherein the method comprises: 

(a) adding into a reaction system a composition according to claim 1 or 41 
containing an analyte of interest in presence or in absence of a 
molecule of interest to be tested; and 

(b) visualising the emission of energy produced in step (a). 

51. A method of screening of a product leading to a change in a physical, chemical, 
biochemical or biological condition in vivo, wherein the method comprises: 

(a) administering to a vertebrate a pharmaceutically acceptable medium 
comprising a composition according to claim 1 or 41 in presence or in 
absence of a molecule of interest to be tested; 

(b) detecting the energy produced in presence of said composition; and 

(c) optionally, measuring the effective concentration of said molecule of 
interest necessary for the detection of the energy in step (b). 

52. A method of screening in vitro a molecule capable of modulating the energy in 
a composition according to claim 1 or 41, wherein the method comprises: 

(a) providing in a biological sample a composition according to claim 1 or 
41 in a reaction system containing the molecule to be tested; 

(b) detecting a modulation of the energy by comparison with a control 
sample containing said composition according to claim 1 or 41 without 
the molecule to be tested; and 

(c) optionally, determining the effective minimal concentration of said 
molecule capable of inhibiting or increasing the energy transfer of said 
composition. 
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Fluorescence and Ca2+-induced bioluminescent activity 
in dissociated neurons in culture infected with adenoviral-G5A vectors. 
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in dissociated neurons in culture infected with adenoviral-SG5A vectors. 
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